Create account

DEV Community

schBenedikt

Posted on Jun 9

I created my own search engine

#bootstrap #python #webdev #programming

I have created my own search engine and I'll tell you how it works.

First of all, I created my own web crawler. It runs on Python and stores various meta data in a mysql database.

web-crawler

A simple web crawler using Python that stores the metadata of each web page in a database.

View on GitHub

The search engine, which I created with Bootstrap, then retrieves the results from mySQL. The bottom right always shows how long the query took.

search-engine

The matching search engine to my web crawler.

https://hub.docker.com/r/schbenedikt/schaechnersearch

Features

Like Projects (The most liked pages are displayed at the top)
Display of the search speed.
Ask AI for help.

View on GitHub

Please note that no robots.txt files are currently taken into account, which is why you cannot simply crawl every page.

What do you think of the project?

Feel free to write it in the comments!

Top comments (7)

Waleed • Jun 15

Cool👍👍

Amin • Jun 9

Pretty daunting task to create a search engine but I welcome the effort and wish you the best.

Any reasons not to use a NoSQL or a Graph database for this kind of project? Not criticizing your work, just curious.

schBenedikt • Jun 10

I don't have any experience with the other database systems yet, so it's easiest to use this one. Do you have any other recommendations?

Amin • Jun 10

I think you might like MongoDB, plus it is a great fit for fast read/write access and you get to host your database for free on MongoAtlas in the cloud (their free tier is very generous).

Good luck for your project!

schBenedikt • Jun 10

Since I have my own server and domain, I am not currently looking for a free provider. If MongoDB is really faster, I will definitely come back to it. At the moment, however, I would like to improve the search algorithm.