1823
Comment:
|
1986
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
= About character encoding = | == Documentation (users) == |
Line 3: | Line 3: |
In general we use the multibyte character encoding UTF-8 as default encoding with the follwing consequences: | [wiki:Self:completesearch/Installation Installation Guide] |
Line 5: | Line 5: |
* The $AC->settings->encoding is 'utf-8' unless overriden in autocomplete_config.php * The text.php is saved as UTF-8 * The css file uses '@charset "utf-8";' |
[wiki:Self:completesearch/Local Set up the system on a local machine (notebook)] |
Line 9: | Line 7: |
* $AC->settings->capitals is utf-8 encoded * In ajax.php we utf-8 encode the query string if $AC->settings->encoding is utf-8 and the charset of content_type is not utf-8 (means the request is sent in a non-utf-8 type) |
[wiki:Self:completesearch/IndexBuilding Index Building: Tools, Formats, etc.] |
Line 12: | Line 9: |
We have to support other encodings like iso-8859-1 because some collections are not utf-8 encoded. The default encoding can be overriden by $config->encoding in the autocomplete_config.php. |
[wiki:Self:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.] |
Line 15: | Line 11: |
To handle non-utf-8 encoding we do the following: * the page encoding of index.php, options.php and change_options.php is determined by $AC->settings->encoding ( <meta http-equiv="content-type" content="text/html;charset=<?php echo $AC->settings->encoding; ?>"> ) * Texts from text.php are utf-8 decoded by $AC->get_text() * The way to write the javascript code in function javascript_rhs (in generate_javascript.php) depends on encoding: if no utf encoding is given the javascript is generated url encoded (this is not necessary if utf-8 is used) * We use mb_strtolower (instead of strtolower) with parameter $AC->settings->encoding to enable utf-8 |
[wiki:Self:completesearch/CharacterEncoding UTF-8 support for PHP (php_mbstring extension)] [wiki:Self:completesearch/PHPErrorReporting Error reporting / handling with PHP] [wiki:Self:completesearch/Php Problems with PHP] == Documentation (developers) == [wiki:Self:completesearch/SourceCodeOverview Source code overview] [http://search.mpi-inf.mpg.de/ir-www/doc Documentation produced by Doxygen] [wiki:Self:CodingConventions Coding Conventions] and [wiki:Self:completesearch/DesignConventions OO and C++ Design Conventions] [wiki:Self:completesearch/Debugging Debugging HowTo] [wiki:Self:completesearch/GNUBuildSystem Building with autoconf/automake] and [wiki:Self:completesearch/CMakeBuildSystem Building with CMake] [wiki:Self:completesearch/MinGW Compiling under MinGW] [wiki:Self:completesearch/ExcerptGenerator Excerpt Generator requirements] [wiki:Self:completesearch/ExcerptGenerator/ThesisTopic Description for a Master's Thesis on Excerpt Generation] [wiki:Self:Projects/ExcerptGeneration Excerpt Generation Project Overview] [wiki:Self:completesearch/Templates Template peculiarities in the CompleteSearch code] [wiki:Self:completesearch/CVSHistory CVS history (TODO: what's this; is it still used?)] |
Line 25: | Line 42: |
== UTF-8 lowercase in PHP (23May07 Markus) == | == HowTos == |
Line 27: | Line 44: |
Requires extension mbstring (for functions like mb_strtolower). Following line required in php.ini | [wiki:Self:completesearch/SeleniumRC Testing with SeleniumRC] |
Line 29: | Line 46: |
{{{ In windows: extension=php_mbstring.dll |
[wiki:Self:completesearch/ModPhpStartetExe Have a click on a link start a local application] |
Line 33: | Line 48: |
or in linux: extension=php_mbstring.so }}} (On geek, the mb_... functions were available by default, on Markus's laptop the line above had to be added.) |
[wiki:Self:completesearch/Examples Example programs etc.] |
Line 40: | Line 51: |
== Texts in text.php are now UTF-8 encoded (23May07 Markus) == | == ToDos == [wiki:Self:completesearch/TODO TODO list] [wiki:Self:NewFeatures New Features that would be nice to have] |
Documentation (users)
[wiki:completesearch/Installation Installation Guide]
[wiki:completesearch/Local Set up the system on a local machine (notebook)]
[wiki:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]
[wiki:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]
[wiki:completesearch/CharacterEncoding UTF-8 support for PHP (php_mbstring extension)]
[wiki:completesearch/PHPErrorReporting Error reporting / handling with PHP]
[wiki:completesearch/Php Problems with PHP]
Documentation (developers)
[wiki:completesearch/SourceCodeOverview Source code overview]
[http://search.mpi-inf.mpg.de/ir-www/doc Documentation produced by Doxygen]
[wiki:CodingConventions Coding Conventions] and [wiki:completesearch/DesignConventions OO and C++ Design Conventions]
[wiki:completesearch/Debugging Debugging HowTo]
[wiki:completesearch/GNUBuildSystem Building with autoconf/automake] and [wiki:completesearch/CMakeBuildSystem Building with CMake]
[wiki:completesearch/MinGW Compiling under MinGW]
[wiki:completesearch/ExcerptGenerator Excerpt Generator requirements]
[wiki:completesearch/ExcerptGenerator/ThesisTopic Description for a Master's Thesis on Excerpt Generation] [wiki:Projects/ExcerptGeneration Excerpt Generation Project Overview]
[wiki:completesearch/Templates Template peculiarities in the CompleteSearch code]
[wiki:completesearch/CVSHistory CVS history (TODO: what's this; is it still used?)]
HowTos
[wiki:completesearch/SeleniumRC Testing with SeleniumRC]
[wiki:completesearch/ModPhpStartetExe Have a click on a link start a local application]
[wiki:completesearch/Examples Example programs etc.]
ToDos
[wiki:completesearch/TODO TODO list]
[wiki:NewFeatures New Features that would be nice to have]