Browsed by
Tag: scraping

PhantomJS scraping on word translation example

PhantomJS scraping on word translation example

In the previous article, we’ve shown an example of web scraping using Jsoup. You may be interested in how it is different from PhantomJS scraping. Jsoup parses HTML content that is available on a page load. In most cases, it’s enough, but in some cases, modern websites have content that is loaded dynamically via JavaScript. Hence Jsoup may receive just an empty page without loaded data (e.g. products). In such cases, headless browsers are a way to go and PhantomJS…

Read More Read More

Web scraping using Jsoup (Java)

Web scraping using Jsoup (Java)

Web scraping is data extraction from websites and Jsoup is quite a popular tool to do it in a convenient way. It is is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents. There are lots of use-cases. For example, you may be looking for a new apartment to rent on a website or monitoring discounts on an e-commerce store. If the website does not have a feature to subscribe to newly added records,…

Read More Read More