indexer understands the following special HTML characters:
< > & "
All HTML-4 character entities: ä ü and other.
Characters in their Unicode code notation: ê