XPath vs CSS Selector: what is going on with these two?

#scraping #xpath

As I was learning scraping I have done many tutorials on it. Some tutorials used XPath and others used god ol' CSS Selectors. Until today I always resorted to CSS because it was a familiar choice. All I new about XPath is that Scrapy uses it by default and any CSS selectors are converted to XPath behind the scenes. Being a complete noob at the time I did not give it much thought. Alas, the time has come for me to sink my teeth into this topic and understand what is the difference between these two types of selectors.

XPath

What it is

XPath stands for XML Path. It uses XML document and queries it to identify elements within it. The path part of XPath means that we need to specify the path from the beginning to the desired element.

Advantages of XPath

Allows navigation up the DOM when looking for elements
More flexible than CSS Selectors
Allows searching for full or partial text in element names with contains keyword

CSS Selector

What it is

CSS Selector uses styles specified in Cascading Style Sheet (CSS) to select desired elements. Most of the web pages online are styled using CSS and that makes CSS Selector a popular choice for a lot of people.

CSS relies on tags, class names, and ids among other things to select what we want. This is in contrast with XPath which uses tree-like structure to select the element.

Advantages of CSS Selector

Simplicity because CSS is easy to pick up
Faster than XPath because we can specify the exact element and completely disregard everything else on the page
Allows attribute selection based on values assigned to them
Allows pseudo selectors for elements whose state is declared with CSS, such as on-hover attributes and checkboxes

Syntax for XPath and CSS Selector

This a great table that shows the differences between XPath and CSS Selector made by Slotix :

Top comments (1)

Harish Rajora • Jul 6 '23 • Edited

Thanks for the informative post! Exploring the differences between XPath and CSS selectors is always interesting. I recently found a detailed guide comparing them offering practical examples and shedding light on when to use each method. Check out 'XPath vs CSS Selectors: A Detailed Guide' for more information. Keep up the Great work!