Differences between revisions 7 and 66 (spanning 59 versions)
Revision 7 as of 2007-05-28 14:33:09
Size: 1823
Editor: dslb-084-058-237-229
Comment:
Revision 66 as of 2007-11-09 17:43:20
Size: 2054
Editor: dslb-084-058-239-098
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= About character encoding = == Documentation (users) ==
Line 3: Line 3:
In general we use the multibyte character encoding UTF-8 as default encoding with the follwing consequences: [wiki:Self:completesearch/Installation Installation Guide]
Line 5: Line 5:
 * The $AC->settings->encoding is 'utf-8' unless overriden in autocomplete_config.php
 * The text.php is saved as UTF-8
 * The css file uses '@charset "utf-8";'
[wiki:Self:completesearch/Local Set up the system on a local machine (notebook)]
Line 9: Line 7:
 * $AC->settings->capitals is utf-8 encoded
 * In ajax.php we utf-8 encode the query string if $AC->settings->encoding is utf-8 and the charset of content_type is not utf-8 (means the request is sent in a non-utf-8 type)
[wiki:Self:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]
Line 12: Line 9:
We have to support other encodings like iso-8859-1 because some collections are not utf-8 encoded.
The default encoding can be overriden by $config->encoding in the autocomplete_config.php.
[wiki:Self:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]
Line 15: Line 11:
To handle non-utf-8 encoding we do the following:
 * the page encoding of index.php, options.php and change_options.php is determined by $AC->settings->encoding
( <meta http-equiv="content-type" content="text/html;charset=<?php echo $AC->settings->encoding; ?>">
)
 * Texts from text.php are utf-8 decoded by $AC->get_text()
 * The way to write the javascript code in function javascript_rhs (in generate_javascript.php) depends on encoding: if no utf encoding is given the javascript is generated url encoded (this is not necessary if utf-8 is used)
 * We use mb_strtolower (instead of strtolower) with parameter $AC->settings->encoding to enable utf-8
[wiki:Self:completesearch/CharacterEncoding UTF-8 support for PHP (php_mbstring extension)]

[wiki:Self:completesearch/PHPErrorReporting Error reporting / handling with PHP]

[wiki:Self:completesearch/Php Problems with PHP]

[wiki:Self:completesearch/NotesToJavascript Notes to Javascript]


== Documentation (developers) ==

[wiki:Self:completesearch/SourceCodeOverview Source code overview]

[http://search.mpi-inf.mpg.de/ir-www/doc Documentation produced by Doxygen]

[wiki:Self:CodingConventions Coding Conventions] and [wiki:Self:completesearch/DesignConventions OO and C++ Design Conventions]

[wiki:Self:completesearch/Debugging Debugging HowTo]

[wiki:Self:completesearch/GNUBuildSystem Building with autoconf/automake] and [wiki:Self:completesearch/CMakeBuildSystem Building with CMake]

[wiki:Self:completesearch/MinGW Compiling under MinGW]

[wiki:Self:completesearch/ExcerptGenerator Excerpt Generator requirements]

[wiki:Self:completesearch/ExcerptGenerator/ThesisTopic Description for a Master's Thesis on Excerpt Generation] [wiki:Self:Projects/ExcerptGeneration Excerpt Generation Project Overview]

[wiki:Self:completesearch/Templates Template peculiarities in the CompleteSearch code]

[wiki:Self:completesearch/CVSHistory CVS history (TODO: what's this; is it still used?)]
Line 25: Line 44:
== UTF-8 lowercase in PHP (23May07 Markus) == == HowTos ==
Line 27: Line 46:
Requires extension mbstring (for functions like mb_strtolower). Following line required in php.ini [wiki:Self:completesearch/SeleniumRC Testing with SeleniumRC]
Line 29: Line 48:
{{{
In windows:
extension=php_mbstring.dll
[wiki:Self:completesearch/ModPhpStartetExe Have a click on a link start a local application]
Line 33: Line 50:
or in linux:
extension=php_mbstring.so
}}}

(On geek, the mb_... functions were available by default, on Markus's laptop the line above had to be added.)
[wiki:Self:completesearch/Examples Example programs etc.]
Line 40: Line 53:
== Texts in text.php are now UTF-8 encoded (23May07 Markus) ==
== ToDos ==

[wiki:Self:completesearch/TODO TODO list]

[wiki:Self:NewFeatures New Features that would be nice to have]

Documentation (users)

[wiki:completesearch/Installation Installation Guide]

[wiki:completesearch/Local Set up the system on a local machine (notebook)]

[wiki:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]

[wiki:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]

[wiki:completesearch/CharacterEncoding UTF-8 support for PHP (php_mbstring extension)]

[wiki:completesearch/PHPErrorReporting Error reporting / handling with PHP]

[wiki:completesearch/Php Problems with PHP]

[wiki:completesearch/NotesToJavascript Notes to Javascript]

Documentation (developers)

[wiki:completesearch/SourceCodeOverview Source code overview]

[http://search.mpi-inf.mpg.de/ir-www/doc Documentation produced by Doxygen]

[wiki:CodingConventions Coding Conventions] and [wiki:completesearch/DesignConventions OO and C++ Design Conventions]

[wiki:completesearch/Debugging Debugging HowTo]

[wiki:completesearch/GNUBuildSystem Building with autoconf/automake] and [wiki:completesearch/CMakeBuildSystem Building with CMake]

[wiki:completesearch/MinGW Compiling under MinGW]

[wiki:completesearch/ExcerptGenerator Excerpt Generator requirements]

[wiki:completesearch/ExcerptGenerator/ThesisTopic Description for a Master's Thesis on Excerpt Generation] [wiki:Projects/ExcerptGeneration Excerpt Generation Project Overview]

[wiki:completesearch/Templates Template peculiarities in the CompleteSearch code]

[wiki:completesearch/CVSHistory CVS history (TODO: what's this; is it still used?)]

HowTos

[wiki:completesearch/SeleniumRC Testing with SeleniumRC]

[wiki:completesearch/ModPhpStartetExe Have a click on a link start a local application]

[wiki:completesearch/Examples Example programs etc.]

ToDos

[wiki:completesearch/TODO TODO list]

[wiki:NewFeatures New Features that would be nice to have]

CompleteSearch: FrontPage (last edited 2017-03-19 13:30:19 by Hannah Bast)