This week's book giveaway is in the Clojure forum.
We're giving away four copies of Clojure in Action and have Amit Rathore and Francis Avila on-line!
See this thread for details.
Win a copy of Clojure in Action this week in the Clojure forum!
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic

Taming Text: what's it all about?

 
Bear Bibeault
Author and ninkuma
Marshal
Pie
Posts: 64192
83
IntelliJ IDE Java jQuery Mac Mac OS X
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
As I had no idea what a book titled Taming Text might be about (other than "taming text"), I thought I'd include the descrption of the book from its Manning page:

SUMMARY
Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are built.

ABOUT THIS BOOK
There is so much text in our lives, we are practically drowning in it. Fortunately, there are innovative tools and techniques for managing unstructured information that can throw the smart developer a much-needed lifeline. You'll find them in this book.

Taming Text is a practical, example-driven guide to working with text in real applications. This book introduces you to useful techniques like full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. You'll explore real use cases as you systematically absorb the foundations upon which they are built.

Written in a clear and concise style, this book avoids jargon, explaining the subject in terms you can understand without a background in statistics or natural language processing. Examples are in Java, but the concepts can be applied in any language.

WHAT'S INSIDE
When to use text-taming techniques
Important open-source libraries like Solr and Mahout
How to build text-processing applications
 
Ulf Dittmer
Rancher
Pie
Posts: 42966
73
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
And one of the authors is Grant Ingersoll, one of the brains behind the Lucene library; 'nuff said.
 
Grant Ingersoll
Author
Greenhorn
Posts: 8
  • 0
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
I like to think of Taming Text as an engineer's practical guide to text-based applications like search and natural language processing with examples using popular open source tools like Apache Solr and Lucene. In many ways, we (the authors) wanted to write the book we all wish we had way back when we started in this field. Our target audience is people who are new to text-based applications or new to the tools we introduce. If you are looking for a hard core explanation of the math underlying this stuff, this book is likely not for you. However, if you want a good sense of the concepts and issues involved with building a real text-based application, we hope you will find the book useful.
 
I agree. Here's the link: http://aspose.com/file-tools
  • Post Reply
  • Bookmark Topic Watch Topic
  • New Topic