aspose file tools*
The moose likes Other Open Source Projects and the fly likes Taming Text: some more examples of where it's useful? Big Moose Saloon
  Search | Java FAQ | Recent Topics | Flagged Topics | Hot Topics | Zero Replies
Register / Login
JavaRanch » Java Forums » Products » Other Open Source Projects
Bookmark "Taming Text: some more examples of where it Watch "Taming Text: some more examples of where it New topic

Taming Text: some more examples of where it's useful?

Michael Swierczek
Ranch Hand

Joined: Oct 07, 2005
Posts: 105
Mr. Ingersoll, Morton, and Farris:

Thank you for taking part in the book giveaway.

Aside from the question and answer system mentioned in Chapter 1, what other applications that a typical developer might build at work or for fun would use the features you describe?

Obviously if I'm going to build my own web search engine, or build a search engine for, say, a very detailed company wiki, your text would be useful.

I could also see it as a way to build a kind of grep++, a command line or graphical tool for very complex searches through large amounts of files.

Are there any other obvious or not so obvious applications of the technologies that I am missing?

Grant Ingersoll

Joined: Jan 03, 2013
Posts: 8

It's a great question and many of them are answered throughout the "core" chapters as well as in chapter 9, which discusses new and upcoming text applications.

Here are a few examples of things that can be built using the concepts in the book:
1. Sentiment analysis -- is this text positive or negative about a product/person/idea
2. Trend detection -- identifying what is trending in the news or in social media
3. Recommendation engine -- i.e. people who bought this also bought that.
4. Automatically identifying and extracting people, places, etc. from text
5. Classifying news into buckets like politics, sports, etc.

There are of course many others. At the end of the day, many of these techniques, esp. search, give you a real fast ranking engine, so any problem that needs ranking of top X items is a good candidate for search. Clustering, classification, named entity recognition are really good at helping you better organize unstructured content. They are also quite helpful in applications that have some component of text but aren't purely text based, like customer profile segmentation, etc.
Michael Swierczek
Ranch Hand

Joined: Oct 07, 2005
Posts: 105
Cool. Okay, that's genuinely interesting. I hadn't really thought of those applications.

Thank you for taking the time to answer.
I agree. Here's the link:
subject: Taming Text: some more examples of where it's useful?
Similar Threads
search engine using XML
how to build a search engine?
Hibernate Search in Action: What do you mean by Search exactly?
How to get started in using this feature?