proflupin.solᵍᵐ 🐍

Posted on Oct 12, 2022

A dummy's guide to Solana's architecture

#solana #web3 #blockchain #nfts

Introduction

This tutorial is meant to be for a spectrum of readers:

Those experienced in the EVM space
Those new to web3
Non-developers

We will be going through some of the core concepts in the Solana architecture. These are basic building blocks for every action that happens on this chain. The concepts we are going to discuss are not only beneficial to understand for aspiring developers but also for end users to make the most out of the experience of fast and cheap transactions on Solana.

The Block Explorer: a window to the chain

Block explorers are online interfaces which provide the user a chance to search, and retrieve data points for past and on-going transactions, wallet addresses and other useful info on a given chain. This provides a dimensionality of transparency between the chain and the users.

There are several pretty decent block explorers on Solana:

Throughout the tutorial, we'll see examples right from the block explorers that juxtapose the theory we discuss. Here are some of the words you'll see often throughout the tutorial:

Blocks, Validators and Proof of History!
Accounts
Instructions and Transactions
Programs
Token Program
Token Metadata Program
Program Derived Addresses (PDA)
SPL-tokens, Fungible Assets and Non-Fungible Tokens (NFTs)

Let's start from the very top. What the hell even is a blockchain?

Blocks, Validators and Proof of History:

Without going into the depths of the jargon ocean, a blockchain is simply a digitally distributed database. The key difference between a typical database and a blockchain is how the stored data is structured.

A blockchain stores and manages data into small groups known as Blocks. A block usually stores important information like input and output hashes, number of transactions, and the event it contains. When a block is filled with data or "created", by a blockchain node, it is linked to the previous blocks. This forms a chain of blocks, or a blockchain.

In the case of Solana, when a node claims to have created a block, the other nodes strive to verify this claim. The node that generated to block is known as the Leader and the nodes that verify the claim are known as Validators.

How does this validation, or consensus work on Solana? Proof of History (PoH)

This is exactly what makes Solana unique. Its extremely easy, fast and cheap-to-use nature allows itself to be named the world's first web-scale blockchain. The underlying technology behind Solana is the novel consensus mechanism known as Proof of History.

"Proof of History is exactly what the name suggests: a proof of historical events", Anatoly Yakovenko.

PoH is a SHA-256 hashed, verifiable delay function (VDF): this function guarantees the following: when the output hash is validated for an input hash, it is by the nature of PoH that we can confirm that some time has passed since we provided the input. This very nature of the VDFs insures the super fast speeds of the Solana blockchain.

Coming back to how Validators prove that the Leader node is the node that generated the block: the Validators run the above PoH VDF on a number of transactions existing on the block to be verified. The outputs of the VDF are then compared with the outputs provided by the Leader node. Once proven, the Leader is then confirmed to have created the block and rewards for validation are distributed to all the validators, including the Leader.

In the above view of a given block #154616344, we can see a variety of information.

Block Hash: The SHA-256 hash (reference number) for a given block on chain.
Leader: The block hash of the node that created the given block.
Previous Block Hash: The parent block's # of the given block.
Reward: The total rewards distributed to the Validators.
Transactions: The total number of transactions (successful + unsuccessful) executed in the given block.

Now, if you click on one of the hashes in the By column of the Transactions section, we land on a page which gives a detailed overview of the Account with hash GFDmfhoPtZZATNxw8zyZPVYKkBZutf1zoZTbRbxDjh5E.

Let us first go through some theory and after which we can truly understand the account page and all its components.

Accounts

Solana's Account model is very unique and is partially responsible for why its so damn fun to build apps on it.

On any blockchain, decentralised apps need to track the current state: number of tokens, NFTs transferred, current highest bidder in an auction, orders for a token on a DEx etc. These states need to be saved in data storing units on-chain, called accounts.

On EVM, smart contracts are executable pieces of code or programs. They are accounts themselves and store their own data. The code inside the smart contract defines how the data within the account is going to be modified.

On Solana, Accounts and on-chain programs interact differently. On-chain executable programs are stored in completely immutable accounts, whereas data that the executable programs interact with is stored in mutable accounts.

All accounts on Solana are assigned a Program as the owner, and only these owners are allowed to make changes to the data (or debit Solana) from a given account. Important to note here: anyone can credit funds permissionless-ly to an account.

A developer coming from the EVM space might find this state-less Account Model to be a little confusing and complex. But this model allows something which is not possible in other chains: Transaction parallelisation. We are going to talk more about this in the following section.

On Solscan's page for account GFDmfhoPtZZATNxw8zyZPVYKkBZutf1zoZTbRbxDjh5E, we can see that the account holds some SOL and has some tokens. We'll discuss what tokens are later. But for now lets check out the section for Transactions. If you click on any one of the Signatures of these transactions, we'll land on a new page which shows some Transaction details. But before we dive deep into this, lets understand what transactions are.

Instructions and Transactions

Instructions are the basic operational units on Solana. Instructions are commands that can be bundled up together to form Transactions. These transactions can then be executed by a Program to read and/or write data from one or more Accounts. The instructions that form the transaction are executed atomically, or one-by-one in the order in which they were added to form the transaction. If one instruction fails, the entire transaction is invalidated.

An instruction takes the following data:

program_id of the Program which owns the accounts who's states are going to change
Array of all accounts the instruction intends to read/write data from.
instruction_data array which specifies more metadata intended for the program program_id.

One of more of such instructions are then bundled to form the transaction which needs to specify the following data before it can be executed:

Array of all accounts it intends to read/write data from.
The array of all instructions the transaction wants to specify in the very same order the user wants to execute them
A recent blockhash: this is required to prevent duplication and stale transactions. The max age of transaction is 150 blocks.
signatures of all accounts the given transaction intends to debit from or to write data into. This signature array needs to be in the same order of the corresponding accounts in the accounts array.

While we are talking about transactions, its important to also talk about the Transactions per Second (TPS) of the network. Solana is famed to have one of the highest TPS on any chain.

There are two types of transactions Vote and Non-Vote transactions. Vote transactions are used to do the network validation as described in the Proof of History section, and hence are not the "true" transactions being utilised for doing the actual work on chain like transferring NFTs / tokens, bidding, selling etc. The graph above shows that in the last 7 days, number of Vote transactions are almost 7 times as much as the Non-Vote transactions. This plays into the true TPS (Transactions per Second) debate of Solana: although Solana is blazingly fast with reported TPS touching averaging >3k, the true TPS (removing transactions for Voting) is <1.5K.

Now that we know what transactions and instructions are, we ought to understand who actually triggers/interacts with them. The answer is Programs.

Programs

We have mentioned Programs quite a lot of times in this tutorial. But what exactly are Programs?

You might have heard of Smart Contracts in other chains like EVM. The equivalent to the smart contracts on Solana are Programs. Programs are nothing but executable code that is stored in completely immutable Accounts marked as executable.

The way Programs differ from Smart Contracts is that Programs on Solana are state-less, ie, they do not store the data that they are processing / acting on. Data and code that processes the data are stored in different accounts. This allows the processing of data in a parallel fashion: multiple programs can reuse the same data in specific accounts for different purposes, in different transactions.

You can imagine this parallelisation to be something similar to how functions in programming work: Variables can be defined to store some data. These variables can then be passed as reference to multiple instances of the same function or different functions asynchronously to make changes to these variables. In this analogy, the variables are Accounts and the functions are Programs.

Coming back to the Solscan page for a transaction with signature 3bsfEaCdbjcBvXQT3Fx85R2TvKL3KJe5kb4G8XYqUFXCBxKzemu6FV3fi2C51TBtcKpjygCfzdXsjg9dTjVxpEo5, we can start to understand what its trying to tell us.

In the transaction overview, we can checkout the block in which the transaction was executed, the fee to run the transaction, whether the transaction was successful or not and most importantly, the main actions that the transaction executes. In this case, there was a transfer of 253 SOL from wallet GFDmfhoPtZZATNxw8zyZPVYKkBZutf1zoZTbRbxDjh5E to 4tzAkc7cGTns1dsVaP18KvMkt9jEmzA1AaV3rMzgyi7f.

We can also see the same on the instruction detail. In the given transaction, only 1 instruction was executed. As it was a simple SOL transfer, the program which interacts with the transaction is the System Program, with hash: 11111111111111111111111111111111.

If we go back to the Account page on Solscan, we can see a section for Token Accounts, and also see some SPL-tokens listed. Let us see what these are.

SPL-tokens, NFTs and the Token Program

Like any other blockchain, Solana has tokens as well. These are assets that exist on-chain and can be traded from person to person or through secondary marketplaces. These tokens fuel several use-cases to the blockchain technology like DeFi and are important for retail adoption of the crypto-economy.

All tokens on Solana follow Solana's Token Program and Metaplex's Token Metadata Standard. Let us see how these programs play together to form different tokens on Solana.

Token Program

Solana's Token program is responsible for defining the common implementation for all tokens on Solana.

The spl-token-cli is a part of the Token program and allows the generation of tokens using command line:

spl-token create-token

You can follow the following reference guide to play around with the CLI but we won't go too deep into the specifics in this tutorial.

To fully understand how tokens work on Solana, its important to understand the account model of the Token program.

In the image above we can see 3 accounts: Wallet Account, Token Account and the Mint Account. The Wallet Account is owned by the native System Program and represents a user's wallet. The Token Account and the Mint Account are owned by the Token Program.

The Mint Account is responsible for storing global information regarding the Token, like current supply, Authority (the account address which has the authority to make changes to the given Account), token name, decimals, number of holders etc.
The Token Account is responsible for storing the relationship between the Wallet Account and the Mint Account, ie, it stores data like the number of tokens held by the wallet etc.

Associated Token Account Program

While discussing Token Accounts, its important to mention about the Associated Token Accounts. Why are these needed?

There can be more than 1 token accounts for a given token in a wallet. When a user sends a token from wallet A to wallet B, the user does not know which token account to specify as the destination token account. To solve this problem, associated token accounts are used to deterministically derive a token account address from a user's System Account (wallet), and the mint address.

While the Mint Account stores some global information regarding tokens, it is not capable of storing enough data to be able to be standardised and used globally by dapps and wallets. Here's where Metaplex's Token Metadata Program comes in.

Token Metadata Program

Its main goal is to attach additional data to all tokens on Solana. This allows to have a standardised way to read and observe tokens on wallets, dapps, and even block explorers which further allows the user experience to be uniform across the ecosystem.

The Metadata program uses PDAs to achieve this.

Program Derived Addresses (PDAs)

When we discussed Accounts, we talked about the need for the account's owner to sign transactions whenever a program makes a change to the said account.

This is problematic and not always possible. Imagine a use-case where we are counting the number of tokens in a given account and this number needs to be saved on-chain. The account's owner will have to sign transactions each time this number changes.

An ingenious solution to this is PDAs. PDAs envelop accounts that can be programmatically controlled by certain programs. This allows programs to sign on behalf of these accounts without a requirement of a private key, as shown below.

PDAs are deterministically derived from a program_id and a string (also known as seeds) like "auction_house" or "token_metadata". If you want to dive deep into how PDAs are generated, I highly recommend reading Solana Cookbook's PDA guide.

The Token Metadata Program attaches an account known as Metadata Account with the Mint Account using a PDA as shown below.

This account stores a lot of useful information that adds a whole new dimensionality to tokens on Solana, that the Token Program could not provide. This account also has a URI field which points to an off-chain JSON object. This object saves information like image, attributes for an NFT etc and helps even further enrich the token with data.

The combination of the Token Program and the Token Metadata Program allows tokens to be categorised into three broad categories depending on two characteristics:

supply: the total number of tokens in circulation.
decimal places: the number of decimals points a token is allowed to have. For example: SOL is allowed to have 9 decimal points, which means, 10^-9 SOL is a valid denomination and can be transferred between wallets.

SPL-tokens

These tokens are based on the Solana Program Library (SPL) and are supported by SOL the same way ERC-20 tokens are supported by ETH: the transaction fees for transferring these tokens from 1 account to another is paid in the native currency, SOL in the case of the Solana chain.

SPL-tokens and their associated metadata can be generated on the CLI using Metaboss

metaboss create fungible -d <decimals> -m <metadata_file>

What sets the SPL-tokens apart are their characteristic of having:

supply > 1
decimal places > 0

Fungible assets

Fungible assets are tokens that have:

supply > 1
decimal places = 0

Having a supply greater than 1 but decimal places equal to 0 creates tokens that have very unique use-cases: gaming assets or utilities like a shield or a piece of wood. These assets with decimals equal to 0 can't be divided or fractionalised but are non-singular in supply, thus allowing distribution to a number of users.

Non-Fungible tokens (NFTs)

These tokens are the ones which have gained a lot of traction in this past year. NFTs are are tokens that are non-replaceable / interchangeable. What this means is that there can not be more than 1 specimen of a given token. Every token is unique and can't be fractionalised. Here are their properties:

supply = 1
decimal places = 0

NFTs are usually represented by a visual asset: it could either be an image, video or gif. The asset could also be an mp3 file thus allowing the possibilities of music NFTs.

DEV Community

A dummy's guide to Solana's architecture

Introduction

The Block Explorer: a window to the chain

Blocks, Validators and Proof of History:

Accounts

Instructions and Transactions

Programs

SPL-tokens, NFTs and the Token Program

Token Program

Associated Token Account Program

Token Metadata Program

Program Derived Addresses (PDAs)

SPL-tokens

Fungible assets

Non-Fungible tokens (NFTs)

Oldest comments (0)

Read next

How to recover LTC sent by mistake to a BTC wallet

Computer Vision Meetup: Towards Resource Efficient Robust Text-to-Image Generative Models

Clae: Multipurpose Chat Bot

Decoding CSS: Mastering Cascade, Selectors, and Specificity for Smarter Styling.