- MAHOUT-1464 Cooccurrence Analysis on Spark 
- MAHOUT-1578 Optimizations in matrix serialization 
- MAHOUT-1572 blockify() to detect (naively) the data sparsity in the loaded data 
- MAHOUT-1571 Functional Views are not serialized as dense/sparse correctly 
- MAHOUT-1566 (Experimental) Regular ALS factorizer with conversion tests, optimizer enhancements and bug fixes 
- MAHOUT-1537 Minor fixes to spark-shell 
- MAHOUT-1529 Finalize abstraction of distributed logical plans from backend operations 
- MAHOUT-1489 Interactive Scala & Spark Bindings Shell & Script processor 
- MAHOUT-1346 Spark Bindings 
- MAHOUT-1555 Exception thrown when a test example has the label not present in training examples 
- MAHOUT-1446 Create an intro for matrix factorization 
- MAHOUT-1480 Clean up website on 20 newsgroups 
- MAHOUT-1561 cluster-syntheticcontrol.sh not running locally with MAHOUT_LOCAL=true 
- MAHOUT-1558 Clean up classify-wiki.sh and add in a binary classification problem 
- MAHOUT-1560 Last batch is not filled correctly in MultithreadedBatchItemSimilarities 
- MAHOUT-1554 Provide more comprehensive classification statistics 
- MAHOUT-1548 Fix broken links in quickstart webpage 
- MAHOUT-1542 Tutorial for playing with Mahout's Spark shell 
- MAHOUT-1533 Remove Frequent Pattern Mining 
- MAHOUT-1532 Add solve() function to the Scala DSL 
- MAHOUT-1530 Custom prompt and welcome message for the Spark Shell 
- MAHOUT-1527 Fix wikipedia classifier example 
- MAHOUT-1526 Ant file in examples 
- MAHOUT-1523 Remove @author tags in sparkbindings 
- MAHOUT-1521 lucene2seq - Error trying to load data from stored field (when non-indexed) 
- MAHOUT-1520 Fix links in Mahout website documentation 
- MAHOUT-1519 Remove StandardThetaTrainer 
- MAHOUT-1517 Remove casts to int in ALSWRFactorizer 
- MAHOUT-1513 Deprecate Canopy Clustering 
- MAHOUT-1511 Renaming core to mrlegacy 
- MAHOUT-1510Goodbye MapReduce 
- MAHOUT-1509 Invalid URL in link from "quick start/basics" page 
- MAHOUT-1508 Performance problems with sparse matrices 
- MAHOUT-1505 structure of clusterdump's JSON output 
- MAHOUT-1504 Enable/fix thetaSummer job in TrainNaiveBayesJob 
- MAHOUT-1503 TestNaiveBayesDriver fails in sequential mode 
- MAHOUT-1502 Update Naive Bayes Webpage to Current Implementation 
- MAHOUT-1501 ClusterOutputPostProcessorDriver has private default constructor 
- MAHOUT-1498 DistributedCache.setCacheFiles in DictionaryVectorizer overwrites jars pushed using oozie 
- MAHOUT-1497 mahout resplit not producing splited files 
- MAHOUT-1496 Create a website describing the distributed ALS recommender 
- MAHOUT-1491 Spectral KMeans Clustering doesn't clean its /tmp dir and fails when seeing it again 
- MAHOUT-1488 DisplaySpectralKMeans fails: examples/output/clusteredPoints/part-m-00000 does not exist 
- MAHOUT-1483 Organize links in web site navigation bar 
- MAHOUT-1482 Rework quickstart website 
- MAHOUT-1476 Cleanup website on Hidden Markov Models 
- MAHOUT-1475 Cleanup website on Naive Bayes 
- MAHOUT-1472 Cleanup website on fuzzy kmeans 
- MAHOUT-1471 Cleanup website for Canopy clustering 
- MAHOUT-1468 Creating a new page for StreamingKMeans documentation on mahout website 
- MAHOUT-1467 ClusterClassifier readPolicy leaks file handles 
- MAHOUT-1466 Cluster visualization fails to execute 
- MAHOUT-1465 Clean up README 
- MAHOUT-1463 Modify OnlineSummarizers to use the TDigest dependency from Maven Central 
- MAHOUT-1460 Remove reference to Dirichlet in ClusterIterator 
- MAHOUT-1459 Move Hadoop related code out of CanopyClusterer 
- MAHOUT-1458 Remove KMeansConfigKeys and FuzzyKMeansConfigKeys 
- MAHOUT-1457 Move EigenSeedGenerator into spectral kmeans package 
- MAHOUT-1455 Forkcount config causes JVM crashes during build 
- MAHOUT-1451 Cleaning up the examples for clustering on the website 
- MAHOUT-1450 Cleaning up clustering documentation on mahout website 
- MAHOUT-1449 Update the Known Issues in Random Forests Page 
- MAHOUT-1448 In Random Forest, the training does not support multiple input files. The input dataset must be one single file. 
- MAHOUT-1447 ImplicitFeedbackAlternatingLeastSquaresSolver tests and features 
- MAHOUT-1445 Create an intro for item based recommender 
- MAHOUT-1440 Add option to set the RNG seed for inital cluster generation in Kmeans/fKmeans 
- MAHOUT-1438 "quickstart" tutorial for building a simple recommender 
- MAHOUT-1434 Dead links on the web site 
- MAHOUT-1433 Make SVDRecommender look at all unknown items of a user per default 
- MAHOUT-1429 Parallelize YtransposeY in ImplicitFeedbackAlternatingLeastSquaresSolver 
- MAHOUT-1428 Recommending already consumed items 
- MAHOUT-1425 SGD classifier example with bank marketing dataset. 
- MAHOUT-1420 Add solr-recommender to examples 
- MAHOUT-1419 Random decision forest is excessively slow on numeric features 
- MAHOUT-1417 Random decision forest implementation fails in Hadoop 2 
- MAHOUT-1416 Make access of DecisionForest.read(dataInput) less restricted 
- MAHOUT-1415 Clone method on sparse matrices fails if there is an empty row which has not been set explicitly 
- MAHOUT-1413 Rework Algorithms page 
- MAHOUT-1388 Add command line support and logging for MLP 
- MAHOUT-1385 Caching Encoders don't cache 
- MAHOUT-1356 Ensure unit tests fail fast when writing outside mvn target directory 
- MAHOUT-1329 Mahout for hadoop 2 
- MAHOUT-1310 Mahout support windows 


