| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
ORDER ONLINE: Green Coffee Products by Ed Hardy Green Energy edhardygreenenergy.com | Mens Compression Socks - Medical Compression Socks - Athletic... allegromedical.com | Compression Clamp,Chamley Compression Clamp,External Fixation... indianorthopaedic.com | Graduated Compression Sock | Compression Hose | Compression Socks for... phc-online.com |
"BOCU" redirects here. For other uses, see BOCU (disambiguation). BOCU-1 is a MIME compatible Unicode compression scheme. BOCU stands for Binary Ordered Compression for Unicode. BOCU-1 combines the wide applicability of UTF-8 with the compactness of SCSU. This Unicode encoding is designed to be useful for compressing short strings, and maintains code point order. BOCU-1 is specified in an Unicode Technical Note.[1] For comparison SCSU was adopted as standard Unicode compression scheme with a byte/code point ratio similar to language-specific code pages. SCSU has not been widely adopted, as it is not suitable for MIME “text” media types. For example, SCSU cannot be used directly in emails and similar protocols. SCSU requires a complicated encoder design for good performance. Usually, the zip, bzip2, and other industry standard algorithms compact larger amounts of Unicode text more efficiently.[2] Both SCSU[3] and BOCU-1[4] are IANA registered charsets.
[edit] DetailsAll numbers in this section are hexadecimal, and all ranges are inclusive. Code points from
The difference between the current code point and the normalized previous code point is encoded as follows:
Each byte range is lexicographically ordered with the following thirteen byte values excluded: Any ASCII input BOCU-1 offers a similar robustness also for input texts without the above mentioned values with the special reset code The optional use of a signature In theory UTF-1 and UTF-8 could encode the original UCS-4 set with 31 bits up to [edit] PatentThe general BOCU algorithm is covered by United States Patent #6,737,994, which also mentions the specific BOCU-1 implementation.[5] IBM, which employed both of the inventors of BOCU-1 at the time it was created, states in the Unicode Technical Note that implementers of a "fully compliant version of BOCU-1" must contact IBM to request a royalty-free license.[6] BOCU-1 is the only Unicode compression scheme described on the Unicode Web site that is known to be encumbered with intellectual property restrictions. By contrast, IBM also filed for a patent on UTF-EBCDIC, but it chose in that case to make the documentation and encoding scheme “freely available to anyone concerned towards making the transformation format as part of the UCS standards,” instead of requiring implementers to request a license.[7] [edit] References
[edit] See also
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ↑ top of page ↑ | about thumbshots |