How easy is it to get Lucene to use existing "stop word" and keyword lists, when it is building it's indexes?
Many thanks for putting the time into making this Open Source text retrieval system more accessible to us all.
Cheers, Bill Bug
Otis Gospodnetic
Author
Greenhorn
Joined: Dec 30, 2004
Posts: 23
posted
0
Hello Bill,
You can use a custom stop-word list together with StopFilter. StopFilter is a TokenFilter that you can include in your custom Analyzer. Lucene also comes with StopAnalyzer class, which already includes StopFilter, so you'll just have to pass your stop-word String array to its constructor.