DEV Community

Joshua M
Joshua M

Posted on

The foundation of web page manipulation and interaction

Content Overview

  • Introduction

  • The web browser

  • The HTML document as an object

  • Javascript – A language for creating and manipulating web pages

  • Why understanding the DOM is important

  • DOM – the standard for accessing web documents

  • Node and Element

  • Methods and Properties

  • Finding, creating, manipulating, styling, and removing elements using HTML DOM methods and properties

  • Conclusion

Prerequisite: You should have a basic understanding of HTML, CSS, and Javascript.

Introduction

The World Wide Web was introduced to simplify the sharing of resources between two or more computers through a connected system, called the internet. These resources could be in the form of text, images, or video. They are usually retrieved and displayed in a human-readable format by a web browser. This write-up provides a succinct overview of the internet browser, and the HTML document, along with the basis for manipulating and interacting with web pages via scripting.

The web browser

A web browser, or browser for short, is an application that retrieves data from a remote or local server and renders it in a form that a user can interact with. The process of parsing and displaying data in a human-readable format is usually done by the browser engine, which is sometimes called a rendering engine. You may check here for different browser engines and comparisons.

The browser receives web contents as byte packets and then converts them into a format it understands. What this means is that the bytes are first converted to characters, then to tokens. Tokens are then converted into nodes. This is simply the process through which the document tree is created. We will talk more about this later in this article.

Process of creating the DOM tree by the browser
How the DOM tree is being created (Image source: web-dev)

For over two decades now, the web browser has been an important tool for retrieving web pages from the internet. We use it for quite many things including shopping, chatting with friends, getting information about the latest trend, sending and retrieving emails, searching for jobs, and so on. For this reason, makers of browsers are always working hard to optimize rendering and security.

The HTML Document as an object

Remember, for a web browser to display web pages, it needs to translate them into a format it can comprehend. This involves converting web content segments into nodes. The initial step in this process involves creating the document node, or object, representing the entire HTML (HyperText Markup Language) document. Consequently, your browser perceives the document as an entity comprised of multiple objects instead of a collection of characters. The image below shows the document object inside the browser's window.

The global window object
The global window object (Image source: medium)

If this so far is not yet clear to you, please take a breath and read again to this point as it lays the foundation upon which this article is built. You may also use the external links for more explanation.

Javascript - a language for creating and manipulating web pages

Javascript is simply a scripting language for creating interactive and dynamic web content. For your script to communicate with the browser and web document, it needs an interface or a standard API called the Document Object Model (DOM). As you have learned above, the browser models the HTML or XML document into an object during parsing which then provides you with an interface through which your script can manipulate the web document.

The good news is that Javascript has evolved over the years, it is now used not only for client code but for the server as well.

Why understanding the DOM is important?

  • It is the standard way by which the browser represents an HTML or XML document for fast rendering of the web page.

  • It allows you to dynamically style, interact, and manipulate web pages through a script. You can hide a navigation element or dropdown and only show it when a user clicks a particular button. That reduces the browser's load time as it dynamically adds an element when it is needed.

  • The DOM transforms a web document into a central object with access to methods and properties.

DOM - the standard for accessing web documents

A web document could be in HTML or XML format. The DOM is a structured representation of the web document in the browser memory after parsing. It is a tree representing the document object.

The World Wide Web Consortium (W3C) is an organization that handles the standardization of the DOM. It separates the DOM standard into three (3) parts, namely:

  • Core DOM - This defines the model for all document types

  • HTML DOM - The model for HTML documents

  • XML DOM - This standard defines the model for XML documents

The primary focus of this article is the HTML DOM.

Node and Element

The term "Node" in this sense is commonly used as a generic name for any object in the DOM tree. A node could be an element, attribute, text, or comment - MDN - nodeType. An element is a node with a specified HTML tag such as <head>, <table>, <p>, etc.

Methods and Properties

Each element in the DOM represents specific data in the web document and responds to some sort of instructions. For you to perform actions such as adding or deleting an element, or even adding an event, you need some sort of command the element responds to.

The HTML DOM provides you with various methods and properties that you can use to manipulate elements. When it comes to working with elements, there are specific actions you can perform using certain methods. On the other hand, properties are values that you can easily obtain or modify as needed. Knowing the distinction between methods and properties is crucial for effectively managing and manipulating elements in your projects. We have quite many of them. However, this article explains those you will most likely be using in your scripts.

Before we go ahead, let's write a sample HTML file we will be using for this tutorial. I added the styling code too just for some nice visual output.

<!DOCTYPE html>
<html lang="en">
  <head>
    <meta charset="UTF-8" />
    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
    <title>Sample HTML Document</title>
    <link rel="stylesheet" href="styles.css" />
  </head>
  <body>
    <div class="container">
      <div class="parent">
        <h1>Sample HTML Document</h1>
        <div class="wrapper">
          <div class="navigation" id="navigation">
            <button class="dom btn" id="dom" data-name="nav-btn">DOM</button>
            <button class="methods btn" id="methods" data-name="nav-btn">Methods</button>
            <button class="properties btn" id="properties" data-name="nav-btn">Properties</button>
          </div>
         <!--Dynamically add text to div element-->
          <div class="content" id="content"></div>
        </div>
    </div>

    <script></script>
  </body>
</html>
Enter fullscreen mode Exit fullscreen mode

You can use the DOM API to manipulate the rendered version of this document. We will see how to do that later in this tutorial.

Here are the styles we have applied:

html,
body {
    margin: 0;
    padding: 0;
    font-family: ui-sans-serif, system-ui, -apple-system, BlinkMacSystemFont, Segoe
            UI, Roboto, Helvetica Neue, Arial, Noto Sans, sans-serif,
        Apple Color Emoji, Segoe UI Emoji, Segoe UI Symbol, Noto Color Emoji;
    scrollbar-width: thin;
    scrollbar-color: #f00;
}
* {
    box-sizing: border-box;
}

.container {
    display: flex;
    justify-content: center;
    align-items: center;
    height: 100vh;
}
.parent {
    height: 200px;
}
.wrapper {
    margin: auto;
    width: 25rem;
    display: grid;
    gap: 2rem;
}
h1 {
    text-align: center;
}
.navigation {
    display: flex;
    gap: 10px;
}
button {
    padding: 8px 15px;
    background-color: #fff;
    border: none;
    outline: none;
    box-shadow: 1px 2px 15px 1px #e0e0e0;
    text-transform: uppercase;
    font-size: 0.92rem;
    font-weight: 500;
    cursor: pointer;
    border-radius: 3px;
    flex-basis: 100%;
}
.content {
    padding: 1px 10px;
    text-align: center;
}

/*Reduce width on devices with a maximum width of 400 pixels*/
@media screen and (max-width: 400px) {
    .wrapper {
        width: 20rem;
    }
}
Enter fullscreen mode Exit fullscreen mode

Now, view the file on your browser, you should see something like,

A web page with a heading and three buttons
Image of a web page with three navigation buttons

Finding, creating, manipulating, styling, and removing elements in the document using HTML DOM methods and properties.

To utilize or modify any element within your script, it is essential to verify its presence within the browser's Document Object Model (DOM). To achieve this, you may use any methods offered by the DOM for that purpose. As previously stated, a method executes a predefined action. Typically, the name of a method provides insights into its function.

Let's say you wanted to retrieve an element, you may use getElementById or querySelector. Either of these two selects an element. The element selected may have children or not. This means you can also access children elements by retrieving their parent and using the children or childNodes property.

  • getElementById

    Finding an element in the browser DOM using this method returns the element with the specified id. If more than one element has the same id, it returns the first in the DOM tree. Also, note that if the element does not exist, getElementById returns null.

    Unlike the querySelector method which returns the generic element type, this method returns a more specific element type.

    Now, let's get the navigation and content element. You may add the following code inside the script tag in the html file or you create a separate file for it and link it to the html file.

    const navs = document.getElementById("navigation"); // get element by id
    const content = document.getElementById("content"); // get element by id
    

    We called the getElementById() method on the document object and specified the id of the element we wanted. Go ahead and log one of those in the console to confirm.

    console.log(navs);
    

Selected node in the DOM using getElementById
Image of selected node using getElementById

You'd want to do more than just retrieve elements in your scripts. Now, assuming you wanted to give the first button of the navigation element a different background color, it is as simple as,

navs.firstElementChild.classList.add("active");
Enter fullscreen mode Exit fullscreen mode
.active {
    background-color: rgb(1, 29, 29) !important;
    color: #fff !important;
}
Enter fullscreen mode Exit fullscreen mode

The preceding code shows that we added a new class to the target element's class list. We also needed to write the class styling code inside our CSS file. Please take note of that. The add() allows us to add a class(es) to an element specified in the script.

firstElementChild is a read-only property that returns the first child element of a specified element.

The classList property returns a token list of the class attribute of an element. CSS class names or tokens of an element are usually separated by a space.

The add method adds one or more class names to an element.

First element of a selected node styled differently
Image of selected element with a different style on its first child. Demo using getElementById, firstElementChild.

  • querySelector

    This method like getElementById returns the first element in the document with a specified CSS selector(s). The selector could be an id, class name, ids, class names, attributes, dataset selector, or even a tag name. Assuming you wanted to select the navigation element by its class name, using querySelector you just need to add a dot (.) before the class name. If you wanted to select by id, you add the hash symbol (#) instead. Like so,

    const navs = document.querySelector(".navigation"); // Select by class name
    const content = document.querySelector("#content"); // Select by id
    const firstButton = document.querySelector(".navigation .dom"); // deep select by class name
    

    Now, let's create a fourth button and append it to the navigation element.

    const fourthButton = document.createElement("button"); // create a new node
    fourthButton.innerText = "Event"; // add inner text
    navs.appendChild(fourthButton); // append child to nav element
    

    We have created a button element, added a text node, and append it to navs.

    The createElement method creates an element node in the browser memory according to a specified tag name.

    The innerText property returns the text content of the element and all its children, without CSS hidden text spacing and tags, except <script> and <style> elements. Read more

    The appendChild method adds new elements to the end of the list of children. It only accepts a node type.

Append a fourth button element to a selected node
Image of selected element with a newly-added child element. Demo using querySelector, appendChild.

  • querySelectorAll

    The querySelectorAll method returns a list of nodes matching the specified selector. These nodes could be text nodes, element nodes, and attribute nodes. When you query a list of elements using querySelectorAll, the returned list can only be accessed by their index number.

    Let's select all button elements with a data attribute of name=nav-btn. Learn more about data attributes. After that, we will set an active class on the button that is currently clicked by simply looping through all button elements and determining if the button is the target element clicked.

    const buttons = document.querySelectorAll("button[data-name=nav-btn]"); // Retrieve all button elements with a data name `nav-btn`
    console.log(buttons);
    
<img src="https://i.imgur.com/nd5ShaL.png" alt="Image of selected nodes using querySelectorAll">
<figcaption><center>Image of selected nodes using **<mark>querySelectorAll</mark>**.</center></figcaption>
Enter fullscreen mode Exit fullscreen mode
const buttonsContainer = document.querySelector(".navigation"); // select parent element
const buttons = document.querySelectorAll("button[data-name=nav-btn]"); // retrieve all elements matching the specified selector
buttonsContainer.addEventListener("click", (e) => {
    // add an event listener on the parent element
    if (e.target.nodeName === "BUTTON") {
        // only execute if target is a button element
        buttons.forEach((btn) => {
            // loop through the array of buttons in the parent element
            // check if the current click is on the button
            if (btn == e.target) {
                // add the active class
                btn.classList.add("active");
            } else btn.classList.remove("active"); // otherwise remove the active class
        });
    } else return;
});
Enter fullscreen mode Exit fullscreen mode

The addEventListener method adds an event to a specified target through a listener. The listener receives a notification whenever an event occurs such as a click.

forEach method iterate over each element of an array and executes the specified function. It returns an undefined value, unlike the map method which returns a new array.

remove is a method which removes the specified class name from DOMTokenList

A gif of manipulating button elements using querySelectorAll


A short output of manipulating button elements using querySelectorAll, addEventListener.

  • getElementsByClassName

    This method retrieves a collection of elements matching the specified class name(s). It returns a type of HTMLCollection, which is a list of HTML elements similar to an array but it's not, because you can't call Array methods like push(), pop(), or join() on it.

    const buttons = getElementsByClassName("btn");
    console.log(buttons);
    

    In the following image, you will see that this method returns a list of elements (HTML Collection). One advantage of using getElementsByClassName over querySelectorAll is the flexibility it offers in accessing elements. While querySelectorAll returns a NodeList that can only be accessed by index, getElementsByClassName allows you to access each element in the returned list either by index or by name. This makes it easier and more convenient to work with the specific elements you need. For instance, the marked portion in the image indicates that the faded items in the HTMLCollection are the named items, so you can access them by their name-like buttons.namedItem('dom').

Image of selected elements using getElementsByClassName
Image of retrieved elements using getElementsByClassName.

Again, we will add a click event to each of the buttons and change the text content of the text element.

const buttonsContainer = document.querySelector(".navigation"); // select parent element
const buttons = document.getElementsByClassName("btn"); // retrieve all elements matching the specified class name
const textElement = document.getElementById("content");
buttonContainer.addEventListener("click", (e) => {
    // add an event listener on the parent element
    if (e.target.nodeName === "BUTTON") {
        // only execute if target is a button element
        Array.from(buttons).forEach((btn) => {
            // loop through the array of buttons in the parent element
            // check if the current click is on the button
            if (btn == e.target) {
                // add the active class
                btn.classList.add("active");
                textElement.innerText = `This is the ${e.target.innerText} button`;
            } else btn.classList.remove("active"); // otherwise remove the active class
        });
    } else return;
});
Enter fullscreen mode Exit fullscreen mode

Both getElementsByClassName and getElementsByTagName are useful methods that return an HTML Collection of elements. It's important to note that the list they return is not an array, but rather an array-like or iterable object. So if you want to iterate over the list, you need to convert it into an actual array. Thankfully, Array.from comes to our rescue by allowing us to easily achieve this conversion.

The Array.from method returns an array from an array-like object or an iterable object. It is used to convert an iterable object to an array.

A short gif of manipulating button elements using getElementsByClassName


A short gif of manipulating button elements using getElementsByClassName, addEventListener.

  • getElementsByTagName

    getElementsByTagName retrieves elements by a given tag name. It accepts only one tag name which must be a string. Similar to getElementsByClassName, it returns a list of elements. You can return all elements in the HTML DOM by supplying the * as the tag name, e.g.getElementsByTagName('*').

    const buttons = getElementsByTagName("*");
    console.log(buttons);
    
<img src="https://i.imgur.com/k0rUl5i.png" alt="Image of selected elements using getElementsByTagName">
<figcaption><center>Image of retrieved elements using **<mark>getElementsByTagName</mark>**.</center></figcaption>
Enter fullscreen mode Exit fullscreen mode

Now, let's remove the last button element from the DOM.

function renderRemoveButtonAndExecute() {
    const arrayOfButtons = document.getElementsByTagName("button");
    const lastElement = arrayOfButtons.item(arrayOfButtons.length - 1);
    const textElement = document.getElementById("content");
    textElement.innerHTML = `<button class="remove-btn">Remove last button</button>`;
    textElement.children[0].addEventListener("click", () =>
        lastElement.remove()
    );
}
renderRemoveButtonAndExecute();
Enter fullscreen mode Exit fullscreen mode

Above, we wrote a function called renderRemoveButtonAndExecute. It retrieves an array of navigation buttons, gets the last button from the array, and removes it when the remove button is clicked.

The item method returns an element with the specified index in an HTMLCollection.

The innerHTML property allows you to set or retrieve the HTML content of an element. See here for reference.

A gif showing how an element was removed from the DOM using getElementsByTagName


A gif showing how an element was removed from the DOM -- getElementsByTagName, remove.

Conclusion

The browser is an incredibly powerful tool equipped with a wide array of APIs. From the HTML DOM to the File API, Web Audio API, Navigation API, and WebRTC, among many others, these APIs enhance the functionality and versatility of the browser. With such capabilities at your fingertips, you can create dynamic web experiences that push boundaries and meet the needs of modern users. For anyone starting in web development, having a solid grasp of the HTML Document Object Model (DOM) is essential. It forms the foundation of web page manipulation and interaction. There are numerous methods and properties within the DOM that enable developers to efficiently work with HTML elements. Although this article only covered a handful of them, it aimed to give beginners a taste of what the DOM has to offer.

Now you've come to the end of this tutorial. I hope this article has added to your knowledge.

Top comments (0)