scraping - Keep code clean

PhantomJS scraping on word translation example

May 7, 2020 Serhii Beliablia Comments 0 Comment

In the previous article, we’ve shown an example of web scraping using Jsoup. You may be interested in how it is different from PhantomJS scraping. Jsoup parses HTML content that is available on a page load. In most cases, it’s enough, but in some cases, modern websites have content that is loaded dynamically via JavaScript. Hence Jsoup may receive just an empty page without loaded data (e.g. products). In such cases, headless browsers are a way to go and PhantomJS…

Web scraping using Jsoup (Java)

May 4, 2020 Serhii Beliablia Comments 2 comments

Web scraping is data extraction from websites and Jsoup is quite a popular tool to do it in a convenient way. It is is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents. There are lots of use-cases. For example, you may be looking for a new apartment to rent on a website or monitoring discounts on an e-commerce store. If the website does not have a feature to subscribe to newly added records,…

Keep code clean

Development blog to share experience

Browsed by
Tag: scraping

PhantomJS scraping on word translation example

May 7, 2020 Serhii Beliablia Comments 0 Comment

Web scraping using Jsoup (Java)

May 4, 2020 Serhii Beliablia Comments 2 comments