Marius Muntean

Posted on Nov 13, 2022

Redis as a Database with Redis OM

#redis #database #graphql #aspnet

My newest interest is Redis, an in-memory data store for key-value pairs, streaming, message brokering and a database.

In this post I’ll give you an introduction into using Redis as a normal database. The code snippets will use C# and the final result will look very familiar to anyone who used Entity Framework.

What you’re not getting is a spoon-feeding, instead you’re going to see the relevant parts of Visualizer, a pet project of mine, where I’m using Redis to ingest and query tweets from Twitter’s sample steam.

Tools

Setting up all the tools is beyond the scope of this post, but I still want to mention the most essential things.

Redis Stack: a suite of multiple components, that includes a Redis instance, multiple Redis modules and RedisInsight. Two modules, RediSearch and RedisJSON are essential here, as they allow us to use Redis as a proper DB. The simplest way to get going is to use the official Docker container.

Redis OM .NET: an official .NET library that offers (CLR) object mapping and query translation. I can recommend it wholeheartedly, even though it isn’t perfect. The excellent community and maintainers go above and beyond the call of duty. Ask them anything on the official Discord server.

A good .NET IDE: like VS Code, Visual Studio Proper or Rider.

And finally, a few spare minutes.

Data Modeling

In Visualizer, I’m receiving a stream of tweets, including their many properties and nested data, which represent 1% of the “real-time” tweets.

The fields are described in the official Twitter API documentation, look for Response Fields.

To store and retrieve the data, I’m using Redis OM, which wants a normal C# class whose properties are annotated.

The data model for a single tweet looks like this

	[Document(StorageType = StorageType.Json, Prefixes = new[] { nameof(TweetModel) })]
	public class TweetModel
	{
	[RedisIdField]
	[Indexed]
	public string? InternalId { get; set; }

	[Indexed]
	public string Id { get; set; }

	/// <summary>
	/// The Tweet ID of the original Tweet of the conversation (which includes direct replies, replies of replies).
	/// </summary>
	[Indexed]
	public string ConversationId { get; set; }

	/// <summary>
	/// The actual UTF-8 text of the Tweet.
	/// </summary>
	[Searchable(PhoneticMatcher = "dm:en")] // src https://github.com/redis/redis-om-node/blob/main/README.md
	public string Text { get; set; }

	[Indexed]
	public string AuthorId { get; set; }

	[Indexed]
	public string Username { get; set; }

	/// <summary>
	/// UTC ticks when the Tweet was created.
	/// </summary>
	/// <value></value>
	[Indexed(Sortable = true, Aggregatable = true)]
	public long CreatedAt { get; set; }

	/// <summary>
	/// Contains details about the location tagged by the user in this Tweet, if they specified one.
	/// </summary>
	[Indexed(Aggregatable = true)]
	public GeoLoc? GeoLoc { get; set; }

	// Truncated for brevity - original https://github.com/mariusmuntean/Visualizer/blob/a76d42973ed55a358af06f71fc786be1b3a8a88e/Visualizer.Shared.Models/TweetModel.cs

	[Indexed]
	public int PublicMetricsLikeCount { get; set; }

	[Indexed]
	public int PublicMetricsRetweetCount { get; set; }

	/// <summary>
	/// A list of Tweets this Tweet refers to. For example, if the parent Tweet is a Retweet, a Retweet with comment (also known as Quoted Tweet) or a Reply, it will include the related Tweet referenced to by its parent.
	/// </summary>
	[Indexed(CascadeDepth = 1)]
	public ReferencedTweet[] ReferencedTweets { get; set; }
	}

view raw Visualizer - TweetModel.cs hosted with ❤ by GitHub

You can find the full file here.

Notice that the class is annotated as a Document *with *StorageType JSON. This lets Redis OM know that we want to store instances of this class in a searchable way and in JSON format. The Prefixes is just for our convenience, to make it easier to distinguish the data in Redis, in case that we have multiple document types.

Each property is also annotated, to instruct Redis OM (and indirectly RediSearch and RedisJSON) which fields should be searchable and how exactly.

The attribute RedisIdField is manadatory and you can interpret it as the primary key of a tweet.
Indexed does exactly what you’d expect.
Searchable is powerful, because it allows full text search in the string property.

CascadeDepth tells Redis OM how deep it should look for annotated properties. My TweetModel class has an array of ReferencedTweet instance. The ReferencedTweet class is not marked as a Document because it is nested inside the TweetModel. It will however contain its own annotated properties. That’s why I set the depth to 1.

All attributes are described here.

Create the Index

After the data is modelled, a so-called index needs to be created in Redis.

First, create a RedisConnectionProvider. In my Visualizer project I’m adding one to the DI container with an extension method

	public static void AddRedisOMConnectionProvider(this WebApplicationBuilder webApplicationBuilder)
	{
	var host = webApplicationBuilder.Configuration.GetSection("Redis")["Host"];
	var port = webApplicationBuilder.Configuration.GetSection("Redis")["Port"];
	var redisConnectionConfiguration = new RedisConnectionConfiguration
	{
	Host = host,
	Port = Convert.ToInt32(port)
	};
	var redisConnectionProvider = new RedisConnectionProvider(redisConnectionConfiguration);
	webApplicationBuilder.Services.AddSingleton(redisConnectionProvider);
	}

view raw RedisOMConfig.cs hosted with ❤ by GitHub

Then, let Redis OM send the command that instructs RediSearch/RedisJSON which field is searchable and how. I’m doing it when one of my microservices starts up.

	public class RedisOMIndexInitializer : IHostedService
	{
	private readonly RedisConnectionProvider _redisConnectionProvider;

	public RedisOMIndexInitializer(RedisConnectionProvider redisConnectionProvider)
	{
	_redisConnectionProvider = redisConnectionProvider;
	}

	public async Task StartAsync(CancellationToken cancellationToken)
	{
	await _redisConnectionProvider.Connection.DropIndexAsync(typeof(TweetModel)).ConfigureAwait(false);
	await _redisConnectionProvider.Connection.CreateIndexAsync(typeof(TweetModel)).ConfigureAwait(false);
	}

	public Task StopAsync(CancellationToken cancellationToken)
	{
	return Task.CompletedTask;
	}
	}

view raw IndexInitialiser.cs hosted with ❤ by GitHub

In Redis, the index will have the name tweetmodel-idx. You can list them all by issuing FT._LIST in the Redis CLI or get a detailed explanation of an index with FT.INFO tweetmodel-idx.

Data Storing

After you have a few tweets (I’m using a library called Tweetinvi) you want to store them.

Redis OM makes this easy by offering us a RedisCollection, where T is our data model, e.g. TweetModel. If it helps, this is analogous to a DbSet in Entity Framework,

To get a hold of such a collection, you make use of the RedisConnectionProvider that you registered previously

var tweetCollection = _redisConnectionProvider.RedisCollection<TweetModel>();

view raw TweetCollection.cs hosted with ❤ by GitHub

On it you call InsertAsync() *and provide your *TweetModel instance

var internalId = await _tweetCollection.InsertAsync(tweetModel);

view raw TweetDbService - Store a Tweet.cs hosted with ❤ by GitHub

That’s it, your tweet is now stored in Redis. You can now have a look at your data with RedisInsight. This is how it looks for me

Filter, Sort and Paginate Tweets

I usually work with relational DBs (mostly Postgres), where filtering, sorting and paginating results is as common as it gets.

In Redis, especially using Redis OM, this is as easy as we’re used to as Entity Framework users.

First, we need a DTO that defines all the necessary filtering, sorting and pagination fields.

	public class FindTweetsInputDto
	{
	public string TweetId { get; set; }
	public string AuthorId { get; set; }
	public string Username { get; set; }
	public string SearchTerm { get; set; }
	public string[] Hashtags { set; get; }

	public bool? OnlyWithGeo { get; set; }
	public GeoFilter? GeoFilter { get; set; }

	public int? PageSize { get; set; }
	public int? PageNumber { get; set; }

	public DateTime? StartingFrom { get; set; }
	public DateTime? UpTo { get; set; }

	public SortField? SortField { get; set; }
	public SortOrder? SortOrder { get; set; }
	}

	public record GeoFilter(double Latitude, double Longitude, double RadiusKm);

	public enum SortField
	{
	CreatedAt,
	Username,
	PublicMetricsLikesCount,
	PublicMetricsRetweetsCount,
	PublicMetricsRepliesCount
	}

	public enum SortOrder
	{
	Ascending,
	Descending
	}

	public record TweetModelsPage(int Total, List<TweetModel> Tweets);

view raw FindTweetsInputDto.cs hosted with ❤ by GitHub

Visualizer offers a GraphQL API, hence the “Input” in the DTOs name.

Next, we have to formulate a query based on the DTO and retrieve the tweets that match that query.

First, get a new RedisCollection instance like you saw in the previous section

var tweetCollection = _redisConnectionProvider.RedisCollection<TweetModel>();

view raw TweetCollection.cs hosted with ❤ by GitHub

Then you can start filtering, e.g. by a tweet’s ID

	if (!string.IsNullOrWhiteSpace(inputDto.TweetId))
	{
	tweetCollection = tweetCollection.Where(t => t.Id == inputDto.TweetId);
	}

view raw TweetDbQueryService - Filter By Tweet ID.cs hosted with ❤ by GitHub

Remember that TweetModel.Text was annotated as Searchable, which allows for full text searches. To filter only tweets that contain a certain word in their text you’d write something like this

	if (!string.IsNullOrWhiteSpace(inputDto.SearchTerm))
	{
	tweetCollection = tweetCollection.Where(t => t.Text == inputDto.SearchTerm);
	}

view raw TweetDbQueryService - Full Text Search.cs hosted with ❤ by GitHub

Note that there’s no call to Contains() or anything like that, just a simple ==.

The TweetModel.CreatedAt property is Indexed and Searchable

	/// <summary>
	/// UTC ticks when the Tweet was created.
	/// </summary>
	/// <value></value>
	[Indexed(Sortable = true, Aggregatable = true)]
	public long CreatedAt { get; set; }

view raw TweetModel - CreatedAt.cs hosted with ❤ by GitHub

Tweets can be filtered by date like so

	if (inputDto.StartingFrom is not null)
	{
	var startingFromTicks = inputDto.StartingFrom.Value.ToUniversalTime().Ticks;
	tweetCollection = tweetCollection.Where(t => t.CreatedAt >= startingFromTicks);
	}

view raw TweetDbQueryService - StartingFrom.cs hosted with ❤ by GitHub

A cool feature of Redis is its native support for the geolocation data type. Tweets can include geolocation information and to filter the tweets that are at a certain distance from a location you’d write something like this

	if (inputDto.GeoFilter is not null)
	{
	tweetCollection = tweetCollection.GeoFilter(model => model.GeoLoc, inputDto.GeoFilter.Longitude, inputDto.GeoFilter.Latitude, inputDto.GeoFilter.RadiusKm, GeoLocDistanceUnit.Kilometers);
	}

view raw TweetDbQueryService - GeoFilter.cs hosted with ❤ by GitHub

Sorting and pagination are nothing special

	// Get the total count of the filtered tweets.
	var count = await tweetCollection.CountAsync().ConfigureAwait(false);

	// Sort tweets.
	var sortField = inputDto.SortField ?? SortField.CreatedAt;
	var orderByDirection = inputDto.SortOrder ?? SortOrder.Descending;
	tweetCollection = (orderByDirection, sortField) switch
	{
	(SortOrder.Ascending, SortField.Username) => tweetCollection.OrderBy(model => model.Username),
	(SortOrder.Ascending, SortField.CreatedAt) => tweetCollection.OrderBy(model => model.CreatedAt),
	(SortOrder.Ascending, SortField.PublicMetricsLikesCount) => tweetCollection.OrderBy(model => model.PublicMetricsLikeCount),
	(SortOrder.Ascending, SortField.PublicMetricsRepliesCount) => tweetCollection.OrderBy(model => model.PublicMetricsReplyCount),
	(SortOrder.Ascending, SortField.PublicMetricsRetweetsCount) => tweetCollection.OrderBy(model => model.PublicMetricsRetweetCount),
	(SortOrder.Descending, SortField.Username) => tweetCollection.OrderByDescending(model => model.Username),
	(SortOrder.Descending, SortField.CreatedAt) => tweetCollection.OrderByDescending(model => model.CreatedAt),
	(SortOrder.Descending, SortField.PublicMetricsLikesCount) => tweetCollection.OrderByDescending(model => model.PublicMetricsLikeCount),
	(SortOrder.Descending, SortField.PublicMetricsRepliesCount) => tweetCollection.OrderByDescending(model => model.PublicMetricsReplyCount),
	(SortOrder.Descending, SortField.PublicMetricsRetweetsCount) => tweetCollection.OrderByDescending(model => model.PublicMetricsRetweetCount),
	_ => tweetCollection
	};

	// Paginate tweets.
	var pageSize = inputDto.PageSize ?? 10;
	var pageNumber = inputDto.PageNumber ?? 0;
	var skipAmount = pageSize * pageNumber > count ? 0 : pageSize * pageNumber;
	tweetCollection = tweetCollection.Skip(skipAmount).Take(pageSize);

view raw TweetDbQueryService - Sorting and pagination.cs hosted with ❤ by GitHub

Finally, you’d want to materialize the query and get your tweets

	// Produce tweets.
	var tweetsIlist = await tweetCollection.ToListAsync();
	var tweetModels = tweetsIlist.ToList();
	return new TweetModelsPage(count, tweetModels);

view raw TweetDbQueryService - Produce Results.cs hosted with ❤ by GitHub

Visualizer has a simple React frontend that allows to query for tweets

And even to filter them by location

There's More to Come

I’m planning to write a few more posts about Visualizer, where I show Redis’ PubSub, quick ranking with SortedSets and GraphQL subscriptions.

If you liked this post let me know on Twitter 😉 (@MunteanMarius), give it a ❤️ and follow me to get more content on Redis, Azure, MAUI and other cool stuff.

DEV Community

Redis as a Database with Redis OM

Tools

Data Modeling

Create the Index

Data Storing

Filter, Sort and Paginate Tweets

There's More to Come

Other content on this topic

Top comments (0)

Read next

Spring Boot + Redis: A Beginner-Friendly Guide to Supercharging Your App’s Performance

Exploring the mongo Shell: A Command-Line Interface for MongoDB

Understanding ACID Properties: The Pillars of Reliable Databases

Three Tier Architecture. Overview and Each tier explained.