this post was submitted on 28 Jun 2025
75 points (94.1% liked)

Technology

72015 readers
2815 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

cross-posted from: https://rss.ponder.cat/post/217784

Signposts on the Vancouver street bear the English name below the official Musqueam name, which is written in the North American Phonetic Alphabet.


From this RSS feed

top 21 comments
sorted by: hot top controversial new old
[–] lka1988@lemmy.dbzer0.com 3 points 5 hours ago

Signposts on the Vancouver street bear the English name below the official Musqueam name, which is written in the North American Phonetic Alphabet.

I don't see the issue here.

[–] sp3ctr4l@lemmy.dbzer0.com 2 points 9 hours ago* (last edited 9 hours ago)

So... what Seattle does in the International District is this:

Street names are written in both English, and... Mandarin, Japanese, Vietnamese.

Sometimes its two different signs, sometimes its one big sign.

I... don't really see how just putting a sign that writes it out in both languages... is not a reasonable solution to this scenario?

But that doesn't really even appear to be the main issue going on here.

...

As a person who has maintained large databases... yeah, it is... possible to implement support for nonstandard characters... but you would have to very, very directly legally require this specific level of support, and probably have a lot of lead time.

Not saying its morally right or justifiable, but the industry standard is almost always to just support the very basic, bare minimum of charsets... and then everything is built on top of that assumption, and the actual core setup configs at that level haven't been touched in 20+ years, and probably there's only 2 people at the entire org that even know... that such things exist, or what actual physical server they are running on.

It can be a bitch and a half to fundamentally rework an entire database system to support and uncommon character set, usually security minded practices will have you scrubbing out or charswapping or banning anything that isn't standard... and if there is any link in the chain, at any point, that doesn't properly support your new charset, well, it all blows up.

So... that is everything from front end to backend that has to come up with a solution, and the reality is, for just most of such modern software systems... that means you're going through god knows how many vendors and liscensed software, and now they all have to be compliant as well, in more or less exactly the same way.

Anyway, NAPA does exist in at least comprehensive character standards:

https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry

Type: variant

Subtag: fonnapa

Description: North American Phonetic Alphabet

Description: Americanist Phonetic Notation

Added: 2016-06-24

So it should be theoretically possible.

...

Another... weird aspect to this is... NAPA is not like the actual written characters that the Musqueam, or any other Peoples of the Salish Sea... actually ever used, historically.

It is a modern, academic alphabet, similar to IPA, primarily developed out of trying to basically reverse engineer almost entirely oral, spoken languages.

It is not something any of them ever historically used as a written character set, outside of modern academia and modern attempts to revive various languages of various peoples, to encourage their use and prevent the languages from going extinct.

Similar to NAPA is Saanich, or SENĆOŦEN, or Sənčáθən.

https://en.m.wikipedia.org/wiki/Saanich_dialect

This writing system is used by the some of the Saanich peoples, academics and modern language revival movements... very roughly speaking, the Saanich peoples/languages are a subset of the broader group of Salish peoples/languages

...

Another problem with this realm is that... there is no 100% respected as an authority standard on how exactly to use or implement exactly which characters in NAPA to represent exactly which sounds... so... different specific Peoples, Tribes, Academics, etc, may be using different characters within NAPA for the same sound.

Its all very confusing from the standpoint of a database / data entry software dev trying to figure out how to actually implement this.

[–] Maxx0r@lemmy.world 46 points 22 hours ago (2 children)

I found this interesting since it's basically about unicode support, so I did some research. This article seems bullshit, it seems like rhe residents are expected to use the english version of the streetname (Musqueamview) and not the indigenous one with uncommon letters. So, the problems described is totally fabricated. But also: databases shouldn't forbid uncommon unicode letters if it isn't called for.

[–] Treczoks@lemmy.world 13 points 17 hours ago

But also: databases shouldn't forbid uncommon unicode letters if it isn't called for.

I stumble across this issue quite often. When you fill out a form for US customs, you are both required to provide exact data and you are only allowed a-z, 0-9, and some punctuation. That you cannot fulfil both because they are mutually exclusive does not cross their blessed little minds.

[–] palordrolap@fedia.io 21 points 20 hours ago (3 children)

Updating databases to support anything other than that which would run on a 1970s mainframe costs the sort of money that eats into C-level's yacht funds, so it won't happen. These are the people who when faced with the "pick two from done right, done quick and done cheap" will never pick the first one.

Or in other words, if your name contains something outside the English alphabet's A-Z, you're out of luck. They'll give you an approximation you don't want and you'll like it. Lower case? What's that? You're Irish and your surname has an apostrophe? F**k you, that's in the bin, you're OBRIEN now.

I was about to suggest SHXWMATHKWAYAMASAM as something that would be bound to work, but it's 18 characters, and, being two more than a power of two, that all but guarantees that someone will truncate it at 16. Sigh.

[–] A_norny_mousse@feddit.org 5 points 20 hours ago (1 children)

Updating databases to support anything other than that which would run on a 1970s mainframe costs the sort of money that eats into C-level’s yacht funds, so it won’t happen.

Even so, multiple strategies to include unicode characters in ASCII exist. *sigh*

[–] deegeese@sopuli.xyz 1 points 13 hours ago* (last edited 13 hours ago)

Putting it in a DB is the easy part.

It’s support in a thousand other systems that deal with addresses that’s the real problem.

For something like a street address, interoperability is a hell of a lot more important than culturally preferred spelling.

[–] Treczoks@lemmy.world 0 points 17 hours ago

More than three syllables, too complicated for the average American.

[–] RickyRigatoni@retrolemmy.com 3 points 16 hours ago

w isn't even a real letter

[–] ExLisper@lemmy.curiana.net 6 points 22 hours ago

Nice trolling, dude.

[–] Ilovethebomb@sh.itjust.works 7 points 23 hours ago (2 children)

šxʷməθkʷəy̓əmasəm Street (pronounced sh-MUS-quee-um-AW-

[–] user224@lemmy.sdf.org 5 points 21 hours ago

My closest attempt at pronouncing that:

echo 'šxʷməθkʷəy̓əmasəm' | aplay -c 1 -f u8 -r 2000 -t raw -
[–] funkless_eck@sh.itjust.works 1 points 17 hours ago (1 children)

as written it should be pronounced

shw-meth-quey-em-ass-em

where the shw is throaty - is it different because of the language?

[–] lime@feddit.nu 2 points 15 hours ago (1 children)

are you thinking because of the use of IPA characters? because those have defined sounds only within the IPA.

[–] funkless_eck@sh.itjust.works 2 points 14 hours ago (1 children)

Is this not IPA? I thought the article said it was

[–] lime@feddit.nu 3 points 14 hours ago (1 children)

it's the alphabet the BC first nations use. it uses similar characters but not with ipa pronunciations.

[–] funkless_eck@sh.itjust.works 2 points 14 hours ago

TIL - thanks

[–] Uff@lemmy.world 0 points 18 hours ago (1 children)

Yet another anglosaxon government suppressing non anglosaxon culture

[–] TheGrandNagus@lemmy.world 0 points 11 hours ago (1 children)

Anglo-Saxon? Lmao what a bizarre way to talk. Not that it even makes sense considering Anglo-Saxon settlers in the US weren't even close to being the majority.

It's down to unicode support. Not a plot from Anglo-Saxons to keep others in their place lol

[–] grue@lemmy.world 2 points 10 hours ago

To be fair, not considering unicode support a priority is a pretty damn English-language-centric attitude.