I have my own part of the text, which I understand the regular expression to find the most common words. I am currently using .match(/(?!'.*')\b\[\w'\]+\b/g). My problem is that it \wdoes not match characters other than alphanumeric characters, and my emojis are never parsed. In particular, I am trying to create a regular expression that will identify words (including abbreviations) and emojis that separate word boundaries.
As an example, I would like to take "Hey there! 👋, let go to the moon 🌝🚀"and get
Array( "Hey", "there", "👋", "let's", "go", "to", "the", "moon", "🌝", "🚀")
source
share