Hello everyone parsing sites is fun, you can use this data for your resource or do it to order. And what if you download, for example, all the messages from a telegram chat or a list of its participants, and then use this data for analytics, or even better, to form your database of users who are interested in this or that topic.
In this series of articles, we will look at how to write our own separate Telegram client that will collect data from the chats we are interested in, and also see how to save this information to our database.
To create a separate client, the asynchronous library "Telethon" is well suited. The library itself can be used both to create telegram bots and to create separate applications working with the Telegram API. The main advantage is clear documentation in which you can find answers to all questions.
We will start creating our project by registering a developer account here
We enter the numeric-letter code that came to Telegram and get to the registration page of the new application. Fill out the form, the first two fields are enough:
If everything is entered correctly, you will see the following information.
I'll make a reservation right away, there will be a little more data, but the App api_id and App api_hash parameters are important to us.
It's not worth closing the page yet. We will take from there the values of App app_id, App api_hash, Short_name for our application.
It would be a good tone not to store our hash and app_id in the code, so let's do it nicely. Use the configparser library to create a settings file. Create a file with the .ini extension in the root of the project (example config.ini) and let's put our data from the developer account that we have registered there.
I know about venv and environment variables. You can use a method that is convenient for you.
Next, the most interesting thing awaits us. Let's install the Telethon library itself in our project with the "pip install telethon" command and import the TelegramClient class from our installed library into the project.
import configparser from telethon import TelegramClient
Next, let's configure the transfer of our data to the connection from the settings file
config = configparser.ConfigParser() config.read("config.ini") api_id: str = config['Telegram']['api_id'] api_hash = config['Telegram']['api_hash'] username = config['Telegram']['username'] client = TelegramClient(username, api_id, api_hash) client.start()
Please note that in the "config.ini" file, we specified [Telegram] as the first line. With these tags, we will simply not get confused about the settings variables and separate them in one file.
Let's create our main function and request information about us from the telegram server.
async def main(): about_me = await client.get_entity('me') print(about_me)
Telethon is asynchronous, which means we will use functions and methods with the addition of the keywords async and await
In order for our client not to close after launch, we will add such an entry at the end of our file
with client: client.loop.run_until_complete(main())
Our app is almost ready for the first launch
This is necessary to create a session file, it will be stored in the root of the project with the extension .session (it's not worth deleting them, we'll talk about them later)
Enter your phone number in international format without "+"
You have received the telegram account code again, only now it consists of 5 digits. Enter them.
Congratulations you have launched your Telegram client.
So what did our main function return to us?
Our "about_me" variable now contains a User object with the telethon library data type.
Inside the object you can see data about your account.
In order to see some information, let's make this code:
async def main(): about_me = await client.get_entity('me') print('Name:', about_me.first_name) print('Nickname:', about_me.username) print('Id', about_me.id) print('Number', about_me.phone)
I hope you liked the result? =)
The Telegram server has just told you a little more about you than you see in your account.
Data leakage from Telegram is a Telegram problem.
All information that we will receive is publicly available. And we can see the same information through the official app. But I still urge you not to use the acquired knowledge for bad purposes. All the information presented is provided for the purpose of education and popularization of the programming language, its libraries and capabilities.
If you, dear reader, show your activity to my work, then in the following articles we will look at a way to get all messages from the telegram chat, as well as all participants of the chat or group, we will study ways to get the user's phone number even if it is hidden and learn how to find information about people in Telegram by their phone number.
Unfortunately English is not my native language, but I am working on it. I will be waiting for your suggestions, feedback and comments.
By tradition, the full code:
import configparser from telethon import TelegramClient config = configparser.ConfigParser() config.read("config.ini") api_id: str = config['Telegram']['api_id'] api_hash = config['Telegram']['api_hash'] username = config['Telegram']['username'] client = TelegramClient(username, api_id, api_hash) client.start() async def main(): about_me = await client.get_entity('me') print('Name:', about_me.first_name) print('Nickname:', about_me.username) print('Id', about_me.id) print('Phone', about_me.phone) with client: client.loop.run_until_complete(main())