-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Visualization tables has lack of descriptions #292
Comments
That's the index number as used by the spec.
These help with understanding what optimizations can be made in implementation. If the number of UTF-8 bytes is known, there's no need to branch to decide how many UTF-8 bytes are needed. "Follows immediately" is important for understanding contiguous ranges that could be arranged in data structures as such.
The number according to which the table is ordered is at the top and the other number is at the bottom. Hence, for the BMP coverage, the Unicode scalar value is at the top and for the index visualization the index number is at the top.
These are hexadecimal encoded byte values. |
I actually found answers for some my questions myself during understanding the "BMP coverage" pages (such as https://encoding.spec.whatwg.org/big5-bmp.html) contains a table 256x256 with following information:
The table represents "Index" pages contains a tables with slightly different structure depending on the encoding with following information:
Single-byte encodings (such as https://encoding.spec.whatwg.org/ibm866.html) contains a high half of the encoding (because they all are ASCII compatible and entries
Multi-byte encodings (such as https://encoding.spec.whatwg.org/big5.html) are more complicated. All such encodings (which are visualized) occupies 1 or 2 bytes per code point. In most cases only ASCII code points occupies 1 byte, so they are not included in visualization, other code points occupies two bytes:
Table dimensions depends on the encoding and represents constants that are used in encoding process. |
https://encoding.spec.whatwg.org/big5-bmp.html
https://encoding.spec.whatwg.org/big5.html
The text was updated successfully, but these errors were encountered: