internationalization - Culture-independent stemmer/analyzer for Lucene.NET -

We are currently developing a full-text search-enabled app and we are our weapons of the Lucene.NET option. What is expected is that any app will be used by people from different countries, so it should be able to search equally well in Russian, English and other texts.

Are both universal and culture independent, stomers and analysts meet our needs? I think we will eventually have to use culture-specific people, but we want to get up from this potentially quick and dirty view.

Given that spelling, grammar and character sets of English and Russian are quite different, any punishments The one who tries to do both, is either large or poor performance (most likely both of them) on a large scale.

This would be better for using a stammer for each language, and to use any UI clues (which are being used for query queries) or on the basis of clear selection Will be chosen.

After saying this, it is not possible that in any Russian text the English search term will match the word true or vice versa.

It looks like a case where a little more than the business analysis code will help.

Braylock

Search This Blog

internationalization - Culture-independent stemmer/analyzer for Lucene.NET -

Comments

Post a Comment