The advantages of various types of strings in Rust

#rust #string

New Rustacians usually ask why Rust has many types of strings. I don't know the original intention. Anyways, it has some advantages as follow:

It minimizes the language core. Rust core has only str. String, OsString, CString are in the standard library, which is not a part of the language core.
Programmer can control performance. For example, &str refers to a byte slice so it should be fast, while String is a wrapper of Vec. String should have more performance penalty from heap memory allocation.
OsString and CString improve interoperability with other programming languages and operating systems. Rust can manipulate C-style strings directly without converting them to native Rust strings. OsString is also similar.

String conversation has a performance penalty. Also sometimes string conversation is not obvious. Some systems don't even use Unicode or other standards.

The alternative of having too many types is treating everything as a byte slice. However, type checking doesn't work well with too few types.

Top comments (3)

૮༼⚆︿⚆༽つ • Sep 16 '21

Most of the time you probably want CString instead of String for handling user input. Unlike String, CString automatically remove any 0 bytes ("nul characters").

I'm still looking on how to remove all non-printable chars in String (UTF-8)

Vee Satayamas • Sep 16 '21

Is a non-printable character a control character?

૮༼⚆︿⚆༽つ • Sep 17 '21 • Edited

I think what I mean is non-graphic character which consist of both Unicode Control Character (Cc, Cf, Cs, Co, Cn) and Separator Format Character (Zl, Zp)

ref: Unicode General Category

DEV Community

The advantages of various types of strings in Rust

Top comments (3)

Read next

Installing Rust on macOS with Homebrew

ALKANES: Smart Contracts on Bitcoin UTXOs

Mastering String Operations in JavaScript 🚀

Starting to Rust