Interacting with a Web Page using the Browser Console: A Simple Example

Daily short news for you

Not to be outdone, Google recently introduced Gemini CLI - a type of AI agent similar to Codex or Claude Code.

What’s notable is that they offer up to... 1000 queries for free each day. That’s quite a lot. Additionally, they have open-sourced this project to ensure transparency, learning, and research 🤓

» Read more
There’s another tool to help quickly search command history: atuinsh/atuin.

What’s interesting is that it uses SQLite for storage. It also provides the feature of completely synchronizing (encrypting) history between machines. How cool is that 🤓

» Read more
I am really impressed with the gemma-3n-E4B model from Google. This is one of the promising models that aims to bring large language models to run on mobile devices or the web or embedded...

It feels like it understands the prompts better, because I’ve tried many low-parameter models that often overlook my prompts. For example, when I say, "Just return the answer, no need for further explanation," many still tend to add introductory phrases or explanations... but with gemma-3n, the responses are very much to the point.

» Read more

The Issue

You may already know that before React, Vue, etc., jQuery dominated the web platform for a long time. Almost every website had to use jQuery because it was a library with many powerful DOM interaction functions. Many libraries were built based on jQuery, and some even required jQuery before use. However, due to its heaviness and bulkiness, along with the excessive abuse of jQuery, it has become increasingly disliked. Coupled with the emergence of many modern web development tools, it is now on the "brink of death."

When accessing a website, every action you take, such as mouse movements, clicks, key presses, etc., reflects an interaction with the DOM. The DOM can be simply understood as the HTML components displayed on the webpage. Additionally, there are a few other things that can interact with the DOM, such as JavaScript. This means that JavaScript can be used to create interactions with the DOM instead of the user. From there, we can automate a series of actions without the need for human interaction.

Most modern browsers provide a feature called Console for writing JavaScript code directly on the website being accessed. That is also the content of today's article; I will take a specific example of how to use JavaScript to perform an action on a website. I hope that through this article, readers can create more practical applications in reality and apply them in their daily work.

DOM and JavaScript

Document Object Model (DOM) is a programming interface for web pages. It represents the webpage so that programs can change the structure, style, and content of the document. The DOM represents the web page (document) as nodes and objects. Thus, it allows some programming languages to interact with the page.

JavaScript can interact with the DOM. Imagine the DOM as a Web API that the browser provides for JavaScript to have full access and editing capabilities. Almost every great experience on the web today features JavaScript. Many frameworks and libraries have emerged based on JavaScript that enhance the ability to interact with the DOM, providing convenience and friendliness to developers.

The browser allows us to run JavaScript code directly on the webpage through Dev Tools. For example, in Chrome, after enabling Dev Tools and entering the Console window, you can write JavaScript code here and run it.

Taking advantage of this, many cyber attackers previously tried to trick users into pasting a snippet of code into Dev Tools to steal user data. For an average user who cannot fully understand the dangers of executing JavaScript code right on the website they are using, this can be risky. However, in this article, I will not delve into that topic; the goal is to help you visualize how to use JavaScript to interact with the DOM.

The theory may be a bit dry, so let's take a specific example of "Continuously Reacting to Messages on the Web Telegram."

Continuously Reacting to Messages on the Web Telegram

In addition to its cross-platform application, Telegram also allows access through the web. One thing I find interesting is the ability to react to messages with emoticons. I like it because often I don't need to reply but just want to drop an icon, or sometimes to let the sender know that I have read their message and how I feel after reading it.

I have a habit of frequently reacting to messages. Partly to let them know that I have read it, partly for "playfulness." My Telegram has a group of friends who often chat about various topics, and to alleviate boredom, I once wrote a piece of code to automatically react to every message that our group sends.

To write the code, we first need to spend some time researching how the reaction feature works.

First, whenever you hover over the timestamp, the icons appear, and clicking on any icon successfully reacts. Other messages are similar. Notice that the DOM containing the icon elements is always there; it is just hidden using CSS and only appears when you hover over.

So the idea is to write a piece of code that retrieves all the DOM elements containing messages, iterates through each one, and clicks on the icon to successfully react.

First, the function to get all messages:

const getListMessage = () => {
  return (
    document.querySelectorAll(
      "div.message-content-wrapper.can-select-text > div"
    ) || []
  );
};

div.message-content-wrapper.can-select-text > div is the selector for each message.

The function checks if a message has been reacted to by checking if the .Reactions element exists.

const checkReactMessage = (elm) => {
  if (elm) {
    return !!elm.querySelector(".Reactions");
  }
  return false;
};

The function that reacts to a message by click:

const react = (elm) => {
  if (elm) {
    elm.click();
  }
};

A main processing function:

const main = () => {
  const listMessage = getListMessage();
  listMessage.forEach((elm) => {
    if (!checkReactMessage(elm)) {
      react(elm.querySelector('div.quick-reaction'));
    }
  });
};

Finally, call the main function and see the result. If you want it to continue reacting to new messages afterward, add the setInterval function to repeat the action periodically.

setInterval(main, 3000);

The result.

Conclusion

The DOM is a programming interface for web documents. JavaScript can easily interact with the DOM. Particularly, the ability to run JavaScript directly from the Console in the browser allows you to comfortably create automated actions on behalf of the user. The above article is a specific example of how JavaScript can continuously react to messages on the web Telegram. Additionally, readers can do the same on other websites they desire. If you have any new ideas, please leave a comment for me and everyone to know. Thank you.

Premium

5 profound lessons

Every product comes with stories. The success of others is an inspiration for many to follow. 5 lessons learned have changed me forever. How about you? Click now!

View all

Interacting with a Web Page using the Browser Console: A Simple Example

The Issue

DOM and JavaScript

Continuously Reacting to Messages on the Web Telegram

Conclusion

Upgrade to Premium

Premium

Premium Plus

Premium