Differences between revisions 6 and 190 (spanning 184 versions)
Revision 6 as of 2007-05-28 14:22:20
Size: 1449
Editor: dslb-084-058-237-229
Comment:
Revision 190 as of 2017-03-19 13:30:19
Size: 204
Editor: Hannah Bast
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= About character encoding = #acl All:read
Line 3: Line 3:
In general we use the multibyte character encoding UTF-8 as default encoding with the follwing consequences: = CompleteSearch =
Line 5: Line 5:
- The $AC->settings->encoding is 'utf-8' unless overriden in autocomplete_config.php
- The text.php is saved as UTF-8
- The css file uses '@charset "utf-8";'
[[QuickIntro|A quick intro to CompleteSearch (how to install + basic principles)]]
Line 9: Line 7:
- $AC->settings->capitals is utf-8 encoded
- In ajax.php we utf-8 encode the query string if $AC->settings->encoding is utf-8 and the charset of content_type is not utf-8 (means the request is sent in a non-utf-8 type)
[[CsvParser|The generic CsvParser]]
Line 12: Line 9:
We have to support other encodings like iso-8859-1 because some collections are not utf-8 encoded.
The default encoding can be overriden by $config->encoding in the autocomplete_config.php.

To handle non-utf-8 encoding we do the following:
- the page encoding of index.php is determined by $AC->settings->encoding
( <meta http-equiv="content-type" content="text/html;charset=<?php echo $AC->settings->encoding; ?>">
)
- Texts from text.php are utf-8 decoded by $AC->get_text()



== UTF-8 lowercase in PHP (23May07 Markus) ==

Requires extension mbstring (for functions like mb_strtolower). Following line required in php.ini

{{{
In windows:
extension=php_mbstring.dll

or in linux:
extension=php_mbstring.so
}}}

(On geek, the mb_... functions were available by default, on Markus's laptop the line above had to be added.)


== Texts in text.php are now UTF-8 encoded (23May07 Markus) ==
[[Details|Details (word id map, ...)]]

CompleteSearch: FrontPage (last edited 2017-03-19 13:30:19 by Hannah Bast)