Differences between revisions 6 and 35 (spanning 29 versions)
Revision 6 as of 2007-05-28 14:22:20
Size: 1449
Editor: dslb-084-058-237-229
Comment:
Revision 35 as of 2007-09-20 15:10:48
Size: 846
Editor: infno1613
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= About character encoding = [wiki:Self:completesearch/Installation Installation Guide]
Line 3: Line 3:
In general we use the multibyte character encoding UTF-8 as default encoding with the follwing consequences: [wiki:Self:CodingConventions Coding Conventions]
Line 5: Line 5:
- The $AC->settings->encoding is 'utf-8' unless overriden in autocomplete_config.php
- The text.php is saved as UTF-8
- The css file uses '@charset "utf-8";'
[wiki:Self:Templates Template pecularities in the CompleteSearch code]
Line 9: Line 7:
- $AC->settings->capitals is utf-8 encoded
- In ajax.php we utf-8 encode the query string if $AC->settings->encoding is utf-8 and the charset of content_type is not utf-8 (means the request is sent in a non-utf-8 type)
[wiki:Self:NewFeatures New Features that would be nice to have]
Line 12: Line 9:
We have to support other encodings like iso-8859-1 because some collections are not utf-8 encoded.
The default encoding can be overriden by $config->encoding in the autocomplete_config.php.
[wiki:Self:completesearch/Markus Markus' comments / log]
Line 15: Line 11:
To handle non-utf-8 encoding we do the following:
- the page encoding of index.php is determined by $AC->settings->encoding
( <meta http-equiv="content-type" content="text/html;charset=<?php echo $AC->settings->encoding; ?>">
)
- Texts from text.php are utf-8 decoded by $AC->get_text()
[wiki:Self:completesearch/CVSHistory CVS history]
Line 21: Line 13:
[wiki:Self:completesearch/TODO TODO list]
Line 22: Line 15:
[wiki:Self:completesearch/ExcerptGenerator Excerpt Generator requirements]
Line 23: Line 17:
== UTF-8 lowercase in PHP (23May07 Markus) == [wiki:Self:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]
Line 25: Line 19:
Requires extension mbstring (for functions like mb_strtolower). Following line required in php.ini [wiki:Self:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]
Line 27: Line 21:
{{{
In windows:
extension=php_mbstring.dll
[wiki:Self:completesearch/Examples Example programs etc.]
Line 31: Line 23:
or in linux:
extension=php_mbstring.so
}}}

(On geek, the mb_... functions were available by default, on Markus's laptop the line above had to be added.)


== Texts in text.php are now UTF-8 encoded (23May07 Markus) ==
[wiki:Self:completesearch/GNUBuildSystem How to use the autoconf/automake tools to build and deliver the project.]

[wiki:completesearch/Installation Installation Guide]

[wiki:CodingConventions Coding Conventions]

[wiki:Templates Template pecularities in the CompleteSearch code]

[wiki:NewFeatures New Features that would be nice to have]

[wiki:completesearch/Markus Markus' comments / log]

[wiki:completesearch/CVSHistory CVS history]

[wiki:completesearch/TODO TODO list]

[wiki:completesearch/ExcerptGenerator Excerpt Generator requirements]

[wiki:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]

[wiki:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]

[wiki:completesearch/Examples Example programs etc.]

[wiki:completesearch/GNUBuildSystem How to use the autoconf/automake tools to build and deliver the project.]

CompleteSearch: FrontPage (last edited 2017-03-19 13:30:19 by Hannah Bast)