Text units for letter-spacing are incorrect #92

r12a · 2020-02-05T05:54:20Z

For various reasons wherever a word needs to be broken in constituent characters in case of Latin script, Indian language words can and should be broken based on Akshara as given here

As the W3C specification points to Unicode Text Segmentation (TR 29), it is observed that some of the browsers support it (e.g. Chrome and Firefox) whereas Microsoft Edge and Interner Explorer seems to break the words in individual characters.

It has been marked as basic as the Unicode Text Segmentation rules themselves need to be matured enough to cater to nuances of many languages that get written using Devanagari script. Some of the languages like Santali, require some special Nukta rules.

Also, in cases where there is wrong Akshara formation e.g. Consonant+Matra+Matra, the breaking seems to stack ill formed akshara into one set instead of clearly breaking it separate. This breaking behaviour needs to improve.

r12a · 2020-02-05T05:54:25Z

The first comment in this issue contains text that will automatically appear in the Devanagari gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

r12a added i:spacing Text spacing gap p:basic doc:deva labels Feb 5, 2020

r12a added the x:deva label May 18, 2021

r12a added the l:hi Hindi, Devanagari script label May 1, 2024

r12a added this to Gap-analysis pipeline Jun 20, 2024

r12a moved this to Issue identified, needing investigation in Gap-analysis pipeline Jun 20, 2024

r12a added the s:deva Devanagari script label Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Text units for letter-spacing are incorrect #92

Text units for letter-spacing are incorrect #92

r12a commented Feb 5, 2020

r12a commented Feb 5, 2020

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Text units for letter-spacing are incorrect #92

Text units for letter-spacing are incorrect #92

Comments

r12a commented Feb 5, 2020

r12a commented Feb 5, 2020

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!