DEV Community

Discussion on: New season, new project: I need you πŸ‘‰ πŸ™

Collapse
 
svedova profile image
Savas Vedova

Sure! So what I mean is that the api could be written in a way that it learns the scraped data, and after a while people can use it only to query data. Imagine I scrape the average car price in the market for a model, and I specify several websites to look for. Next time another user might use just the data and they won't have to rewrite the same scraper.

I am saying this because I was using puppeteer and it was is very easy to use, I haven't seen the need for another tool when I used it. The hard part (in terms of effort) was to write the scraper and save the data into the database. It just takes time.

Thread Thread
 
daolf profile image
Pierre

Oh, I see, it makes sense now.

However, ScrapingNinja only returns you raw HTML, no formatted data, maybe it is not that clear on the landing page.

What you are talking about is some kind of product that gets a URL in input and that output formatted data so you don't have to configure XPath / CSS selector whatever.

Thank you very much for your time and feedback.