Propose an extension to fields to include language and transliteration status

Comments (4)

Joseph Heenan

As I mentioned on today’s call, OpenID Connect Core already defines the ability to have claims in multiple languages:

‌

https://openid.net/specs/openid-connect-core-1_0.html#ClaimsLanguagesAndScripts

It’s not immediately obvious to me how you would use the syntax to indicate whether a claim had been transliterated or not.

2020-10-14T16:56:30+00:00

Julian White reporter

Yes, I think what I would get is the language used by the OP rather than whether its been transliterated or not.

‌

I think what I want to know is:

the name as recognised/used by the OP (mandatory - obvs)
the language the OP is using for that name (mandatory where the language used for the claim is not the same as the one expected from the OP)
whether the OP has transliterated that name, and if so how; options here could be:
1. original - name is the same as in the natural language name
2. manual - manual transliteration by someone, could be a bit random in how they do it
3. automated_9303 - computed transliteration following ICAO 9303 rules (probably the most common)
the natural language version of the name (optional)
the natural language of the name (optional)

This is most interesting when you are taking things from electronic travel documents as they can store both the transliterated and natural language version of the names via data group 1 and 11 of the chip.

‌

Using my previous example, if I had a German national in the UK that was trying to use eKYC-IDA API to do something on a German system its likely that the UK OP would return MOELLER to the request with a language of en-GB. The German system can’t tell from that whether it should be using MOELLER, MÖLLER or MØLLER on their side because they don’t know whether that was transliterated or not. If it wasn’t transliterated then they know its MOELLER, if it is then it could be MÖLLER or MØLLER. If by chance the UK system had the original data from the passport chip then it could also send the natural language data even if within the OP that isn’t the name they are hanging their records off, so the German system would receive the original MÖLLER or MØLLER as well MOELLER.

2020-10-20T23:49:03+00:00

Mark Haine

removed component

2020-10-21T15:35:40+00:00

Nat Sakimura

In some languages and cultures, transliteration is not readily possible and the option in such a case is that one registers preferred ASCII representation of their names. In such cases, it may be useful to have information about what representation it is, e.g., Passport representation, etc.

Phonetic transcription also does not work in many cases. FYI, Japanese names have no official phonetics representation but just the characters.

2021-10-06T16:09:38+00:00

Joseph Heenan
As I mentioned on today’s call, OpenID Connect Core already defines the ability to have claims in multiple languages:

‌

https://openid.net/specs/openid-connect-core-1_0.html#ClaimsLanguagesAndScripts

It’s not immediately obvious to me how you would use the syntax to indicate whether a claim had been transliterated or not.
- 2020-10-14T16:56:30+00:00
Julian White reporter
Yes, I think what I would get is the language used by the OP rather than whether its been transliterated or not.

‌

I think what I want to know is:
1. the name as recognised/used by the OP (mandatory - obvs)
2. the language the OP is using for that name (mandatory where the language used for the claim is not the same as the one expected from the OP)
3. whether the OP has transliterated that name, and if so how; options here could be:
  1. original - name is the same as in the natural language name
  2. manual - manual transliteration by someone, could be a bit random in how they do it
  3. automated_9303 - computed transliteration following ICAO 9303 rules (probably the most common)
4. the natural language version of the name (optional)
5. the natural language of the name (optional)
This is most interesting when you are taking things from electronic travel documents as they can store both the transliterated and natural language version of the names via data group 1 and 11 of the chip.

‌

Using my previous example, if I had a German national in the UK that was trying to use eKYC-IDA API to do something on a German system its likely that the UK OP would return MOELLER to the request with a language of en-GB. The German system can’t tell from that whether it should be using MOELLER, MÖLLER or MØLLER on their side because they don’t know whether that was transliterated or not. If it wasn’t transliterated then they know its MOELLER, if it is then it could be MÖLLER or MØLLER. If by chance the UK system had the original data from the passport chip then it could also send the natural language data even if within the OP that isn’t the name they are hanging their records off, so the German system would receive the original MÖLLER or MØLLER as well MOELLER.
- 2020-10-20T23:49:03+00:00
Mark Haine
- removed component
- 2020-10-21T15:35:40+00:00
Nat Sakimura
In some languages and cultures, transliteration is not readily possible and the option in such a case is that one registers preferred ASCII representation of their names. In such cases, it may be useful to have information about what representation it is, e.g., Passport representation, etc.

Phonetic transcription also does not work in many cases. FYI, Japanese names have no official phonetics representation but just the characters.
- 2021-10-06T16:09:38+00:00
Log in to comment