Linux
Welcome to c/linux!
Welcome to our thriving Linux community! Whether you're a seasoned Linux enthusiast or just starting your journey, we're excited to have you here. Explore, learn, and collaborate with like-minded individuals who share a passion for open-source software and the endless possibilities it offers. Together, let's dive into the world of Linux and embrace the power of freedom, customization, and innovation. Enjoy your stay and feel free to join the vibrant discussions that await you!
Rules:
-
Stay on topic: Posts and discussions should be related to Linux, open source software, and related technologies.
-
Be respectful: Treat fellow community members with respect and courtesy.
-
Quality over quantity: Share informative and thought-provoking content.
-
No spam or self-promotion: Avoid excessive self-promotion or spamming.
-
No NSFW adult content
-
Follow general lemmy guidelines.
view the rest of the comments
jakeJakej4keJak3j@k3JAK€jπ⸦kE𝚥ᎪᏦ⋲ꓙᏎ🅺Ꮛ𞋕ꮜ𝈲𝈁᜴ᚣᜩᗕIt is not this, but same problem scope. Resolve all to "jake" for further processing. Also specifically looking for that ck.
trcould definitely do some work here. maybe something like, echo each word and its translated counterpart, sort on first column, and thenecho $col1 >> $col0for each line? it's a start at leastIf you want
᜴ᚣᜩᗕto get resolved tojakethen bash will be a pain to use. I would use pythonFor each ASCII letter create a list of non-ASCII characters that look similar. Then, for each word you want to match construct a regex
dictionary = { ...'j': ['j', 'J', '𝚥', '𞋕', ...[jJ𝚥𞋕][aA4@][kKᏦ🅺][eE3€]In general the group of problems that you are touching here is https://en.wikipedia.org/wiki/String_metric but I'm not sure if there is an algorithm that would be so "visual" matching
Thanks. This was helpful.