Differences between revisions 8 and 65 (spanning 57 versions)
Revision 8 as of 2007-05-28 14:43:29
Size: 1819
Editor: dslb-084-058-237-229
Comment:
Revision 65 as of 2007-11-06 20:16:58
Size: 1986
Editor: 216
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= About character encoding = == Documentation (users) ==
Line 3: Line 3:
In general we use the multibyte character encoding UTF-8 as default encoding with the follwing consequences: [wiki:Self:completesearch/Installation Installation Guide]
Line 5: Line 5:
 * The $AC->settings->encoding is 'utf-8' unless overriden in autocomplete_config.php
 * The text.php is saved as UTF-8
 * The css file uses '@charset "utf-8";'
 * $AC->settings->capitals is utf-8 encoded
 * In ajax.php we utf-8 encode the query string if $AC->settings->encoding is utf-8 and the charset of content_type is not utf-8 (means the request is sent in a non-utf-8 type)
 * We use mb_strtolower (instead of strtolower) with parameter $AC->settings->encoding to enable utf-8
[wiki:Self:completesearch/Local Set up the system on a local machine (notebook)]
Line 12: Line 7:
We have to support other encodings like iso-8859-1 because some collections are not utf-8 encoded.
The default encoding can be overriden by $config->encoding in the autocomplete_config.php.
[wiki:Self:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]
Line 15: Line 9:
If the encoding is not UTF-8 we do the following:
 * the page encoding of index.php, options.php and change_options.php is determined by $AC->settings->encoding
(<meta http-equiv="content-type" content="text/html;charset=<?php echo $AC->settings->encoding; ?>">
)
 * Texts from text.php are utf-8 decoded by $AC->get_text()
 * The way to write the javascript code in function javascript_rhs (in generate_javascript.php) depends on encoding: if no utf encoding is given the javascript is generated url encoded (this is not necessary if utf-8 is used)
[wiki:Self:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]

[wiki:Self:completesearch/CharacterEncoding UTF-8 support for PHP (php_mbstring extension)]

[wiki:Self:completesearch/PHPErrorReporting Error reporting / handling with PHP]

[wiki:Self:completesearch/Php Problems with PHP]


== Documentation (developers) ==

[wiki:Self:completesearch/SourceCodeOverview Source code overview]

[http://search.mpi-inf.mpg.de/ir-www/doc Documentation produced by Doxygen]

[wiki:Self:CodingConventions Coding Conventions] and [wiki:Self:completesearch/DesignConventions OO and C++ Design Conventions]

[wiki:Self:completesearch/Debugging Debugging HowTo]

[wiki:Self:completesearch/GNUBuildSystem Building with autoconf/automake] and [wiki:Self:completesearch/CMakeBuildSystem Building with CMake]

[wiki:Self:completesearch/MinGW Compiling under MinGW]

[wiki:Self:completesearch/ExcerptGenerator Excerpt Generator requirements]

[wiki:Self:completesearch/ExcerptGenerator/ThesisTopic Description for a Master's Thesis on Excerpt Generation] [wiki:Self:Projects/ExcerptGeneration Excerpt Generation Project Overview]

[wiki:Self:completesearch/Templates Template peculiarities in the CompleteSearch code]

[wiki:Self:completesearch/CVSHistory CVS history (TODO: what's this; is it still used?)]
Line 24: Line 42:
== UTF-8 lowercase in PHP (23May07 Markus) == == HowTos ==
Line 26: Line 44:
Requires extension mbstring (for functions like mb_strtolower). Following line required in php.ini [wiki:Self:completesearch/SeleniumRC Testing with SeleniumRC]
Line 28: Line 46:
{{{
In windows:
extension=php_mbstring.dll
[wiki:Self:completesearch/ModPhpStartetExe Have a click on a link start a local application]
Line 32: Line 48:
or in linux:
extension=php_mbstring.so
}}}

(On geek, the mb_... functions were available by default, on Markus's laptop the line above had to be added.)
[wiki:Self:completesearch/Examples Example programs etc.]
Line 39: Line 51:
== Texts in text.php are now UTF-8 encoded (23May07 Markus) ==
== ToDos ==

[wiki:Self:completesearch/TODO TODO list]

[wiki:Self:NewFeatures New Features that would be nice to have]

Documentation (users)

[wiki:completesearch/Installation Installation Guide]

[wiki:completesearch/Local Set up the system on a local machine (notebook)]

[wiki:completesearch/IndexBuilding Index Building: Tools, Formats, etc.]

[wiki:completesearch/DocumentFormats Document Formats: .docs, .words, .vocabulary, etc.]

[wiki:completesearch/CharacterEncoding UTF-8 support for PHP (php_mbstring extension)]

[wiki:completesearch/PHPErrorReporting Error reporting / handling with PHP]

[wiki:completesearch/Php Problems with PHP]

Documentation (developers)

[wiki:completesearch/SourceCodeOverview Source code overview]

[http://search.mpi-inf.mpg.de/ir-www/doc Documentation produced by Doxygen]

[wiki:CodingConventions Coding Conventions] and [wiki:completesearch/DesignConventions OO and C++ Design Conventions]

[wiki:completesearch/Debugging Debugging HowTo]

[wiki:completesearch/GNUBuildSystem Building with autoconf/automake] and [wiki:completesearch/CMakeBuildSystem Building with CMake]

[wiki:completesearch/MinGW Compiling under MinGW]

[wiki:completesearch/ExcerptGenerator Excerpt Generator requirements]

[wiki:completesearch/ExcerptGenerator/ThesisTopic Description for a Master's Thesis on Excerpt Generation] [wiki:Projects/ExcerptGeneration Excerpt Generation Project Overview]

[wiki:completesearch/Templates Template peculiarities in the CompleteSearch code]

[wiki:completesearch/CVSHistory CVS history (TODO: what's this; is it still used?)]

HowTos

[wiki:completesearch/SeleniumRC Testing with SeleniumRC]

[wiki:completesearch/ModPhpStartetExe Have a click on a link start a local application]

[wiki:completesearch/Examples Example programs etc.]

ToDos

[wiki:completesearch/TODO TODO list]

[wiki:NewFeatures New Features that would be nice to have]

CompleteSearch: FrontPage (last edited 2017-03-19 13:30:19 by Hannah Bast)