Here it is – Sematext’s new and shinny blog.
We’ll be writing about topics that are dear and important to us – search (both web search and enterprise search), text analytics, natural language processing (sentiment detection, named entity recognition…), machine learning, information gathering (e.g. web crawling), information extraction, e-discovery, recommendation engines, etc. There will be a lot of talk about tools we use regularly – Lucene, Solr, Nutch, Mahout and Taste, Hadoop, HBase and friends, and more.
To subscribe, use the orange feed icon or just go to http://feeds.feedburner.com/SematextBlog. If you are a Twitter user, you can follow @sematext on Twitter, too.