We are growing and looking for smart people to join us either in an “elastic”, on-demand, per-project, or more permanent role:
Lucene/Solr expert who…
- Has built non-trivial applications with Lucene or Solr or Elastic Search, knows how to tune them, and can design systems for large volume of data and queries
- Is familiar with (some of the) internals of Lucene or Solr or Elastic Search, at least on the high level (yeah, a bit of an oxymoron)
- Has a systems/ops bent or knows how to use performance-related UNIX and JVM tools for analyzing disk IO, CPU, GC, etc.
Data Analytics expert who…
- Has used or built tools to process and analyze large volumes of data
- Has experience using HDFS and MapReduce, and have ideally also worked with HBase, or Pig, or Hive, or Cassandra, or Voldemort, or Cascading or…
- Has experience using Mahout or other similar tools
- Has interest or background in Statistics, or Machine Learning, or Data Mining, or Text Analytics or…
- Has interest in growing into a Lead role for the Data Analytics team
We like to dream that we can find a person who gets both Search and Data Analytics, and ideally wants or knows how to marry them.
Ideal candidates also have the ability to:
- Write articles on interesting technical topics (that may or may not relate to Lucene/Solr) on Sematext Blog or elsewhere
- Create and give technical talks/presentations (at conferences, local user groups, etc.)
Additional personal and professional traits we really like:
- Proactive and analytical: takes initiative, doesn’t wait to be asked or told what to do and how to do it
- Self-improving and motivated: acquires new knowledge and skills, reads books, follows relevant projects, keeps up with changes in the industry…
- Self-managing and organized: knows how to parcel work into digestible tasks, organizes them into Sprints, updates and closes them, keeps team members in the loop…
- Realistic: good estimator of time and effort (i.e. knows how to multiply by 2)
- Active in OSS projects: participates in open source community (e.g. mailing list participation, patch contribution…) or at least keeps up with relevant projects via mailing list or some other means
- Follows good development practices: from code style to code design to architecture
- Productive, gets stuff done: minimal philosophizing and over-designing
Here are some of the Search things we do (i.e. that you will do if you join us):
- Work with external clients on their Lucene/Solr projects. This may involve anything from performance troubleshooting to development of custom components, to designing highly scalable, high performance, fault-tolerant architectures. See our services page for common requests.
- Provide Lucene/Solr technical support to our tech support customers
- Work on search-related products and services
A few words about us:
We work with search and big data (Lucene, Solr, Nutch, Hadoop, MapReduce, HBase, etc.) on a daily basis. Our projects with external clients range from 1 week to several months. Some clients are small startups, some are large international organizations. Some are top secret. New customers knock on our door regularly and this keeps us busy at pretty much all times. When we are not busy with clients we work on our products. We run search-lucene.com and search-hadoop.com. We participate in open-source projects and publish monthly Digest posts that cover Lucene, Solr, Nutch, Mahout, Hadoop, and HBase. We don’t write huge spec docs, we work in sprints, we multitask, and try our best to be agile. We send people to conferences, trainings (Hadoop, HBase, Cassandra), and certifications (2 of our team members are Cloudera Certified Hadoop Developers).
We are a small and mostly office-free, highly distributed team that communicates via email, Skype voice/IM, BaseCamp. Some of our developers are in Eastern Europe, so we are especially open to new team members being in that area, but we are also interested in good people world-wide, from South America to Far East.
Interested? Please send your resume to jobs @ sematext.com.