clear query| facets| time Search criteria: author:"Jeff Eastman".   Results from 41 to 50 from 1288 (0.0s).
Loading phrases to help you
refine your search...
[expand - 10 more] - Clustering from DB - Mahout - [mail # user]
...It's been over a year since I ran any tests of KMeans on larger data  sets and there has been a lot of refactoring done in the interim. I was  also using only dense vectors. It is ...
   Author: Jeff Eastman , 2009-07-27, 14:07
[expand - 1 more] - mahout clusterdump output - Mahout - [mail # user]
...Mattie is correct on the VL/CL notation. Convergence; however, does not  mean that the cluster centers have stopped moving, only that their  movement is below a certain threshold. ...
   Author: Jeff Eastman , 2012-09-11, 14:28
Choosing appropriate values for T1 and T2 for canopy clustering - Mahout - [mail # user]
...The T2 value you select will determine the number of clusters you get. The T1 value determines how much points which are near to each cluster will influence it in the final centroid calculat...
   Author: Jeff Eastman , 2011-04-13, 16:04
[expand - 1 more] - kmeans - Mahout - [mail # user]
...Suggest you look at the wiki and the examples in the code base. You really need to provide more information in order to get more focused answers. Have you read Mahout in Action? An excellent...
   Author: Jeff Eastman , 2011-03-07, 17:40
Introduction to Apache Mahout K-means clustering - Mahout - [mail # user]
...See the response "Re: Clustering without hadoop" by Johannes Schulte two  postings earlier than yours on user@m.a.o. The driver functions can also  be run in sequential mode from a...
   Author: Jeff Eastman , 2012-11-13, 13:56
[expand - 1 more] - Issue: Canopy is processing extremly slow, what goes wrong? - Mahout - [mail # user]
...Canopy is very sensitive to the value of T2: Too small a value will  cause the creation of very many canopies in each mapper and these will  swamp the reducer.  I suggest you ...
   Author: Jeff Eastman , 2012-11-13, 14:01
[expand - 1 more] - String clustering and other newbie questions - Mahout - [mail # user]
...Well, all of the clustering code is based upon clustering points in an  n-dimensional vector space and all of the APIs operate upon Vectors. We  do support the ability to attach a ...
   Author: Jeff Eastman , 2009-08-28, 18:09
Submitting mahout jobs to map/reduce cluster with fair scheduling - Mahout - [mail # user]
...That Job extends org.apache.mahout.common.AbstractJob, so it probably  will accept a -D argument to set "mapred.fairscheduler.pool=..." . Have  you tried this?   On 11/8/12 3:...
   Author: Jeff Eastman , 2012-11-09, 01:11
[expand - 1 more] - Is the implementation of CIMapper thread safe ? - Mahout - [mail # user]
...Hi Yunming,  The problem I see with what you are proposing is that Hadoop only gives  you a single input vector per call of CIMapper.map(). Using multiple  threads to perform ...
   Author: Jeff Eastman , 2012-12-21, 16:48
[expand - 1 more] - About Dirichlet clustering's threshold - Mahout - [mail # user]
...Here's a response to a similar question from a couple of months ago:  The classification phase of Dirichlet uses a most-likely assignment of  points to clusters by default. This me...
   Author: Jeff Eastman , 2012-12-25, 20:44