A ColdFusion application that we developed a couple years ago worked with Twilio to log incoming text messages. The UTF-8 message payloads were saved in a MSSQL database using the NVARCHAR datatype and could be displayed on webpages without any issue. When importing a CSV file into a third-party Windows program, a random error would cause the import to abort whenever it encountered a high ASCII character. We didn't want to strip out the data, but we also didn't want to convert emojis to HTML entities or decimal values as they would be somewhat meaningless outside of an HTML environment.
That's when we discovered the email-java java library. It bills itself as the "The missing emoji library for Java" and really does add many emoji-specific features that aren't natively available in Java (or ColdFusion).
Here's a cf-email-java wrapper to help identify, sanitize and convert emojis in CFML projects. (NOTE: This my first time I creating a project in Github versus just creating a simple gist.)
For the CSV export, we used the parseToAliases
method on the strings and messages like I like š
were converted to I like :pizza:
.
Here's some examples of functionality.
emojijava.isEmoji('ā¤ļø'); // true
emojijava.isEmoji('I ā¤ļø š'); // false
emojijava.containsEmoji('I ā¤ļø š'); // true
emojijava.isOnlyEmojis('I ā¤ļø š'); // false
emojijava.isOnlyEmojis('š ā¤ļø š'); // true
emojijava.parseToAliases('I like š'); // I like :pizza:
emojijava.parseToHtmlDecimal('I ā¤ļø š'); // I ❤ļø 🍕
emojijava.parseToHtmlHexadecimal('I ā¤ļø š'); // I ❤ļø 🍕
emojijava.removeAllEmojis('I ā¤ļø š'); // I
emojijava.removeAllEmojisExcept('I ā¤ļø š', "pizza"); // I š
emojijava.removeEmojis(text, "pizza"); // I ā¤ļø
emojijava.removeEmojis('I ā¤ļø š', "[emoji]"); // I [emoji] [emoji]
emojijava.extractEmojis('I ā¤ļø š'); // I ["ā¤ļø", "š"]
emojijava.extractEmojis('I ā¤ļø š', true); // an array of structs w/emoji data
Top comments (0)