Scalable Advanced Massive Online Analysis

New Release 0.2.0 !
View on GitHub yahoo/samoa


SAMOA is distributed streaming machine learning (ML) framework that contains a programing abstraction for distributed streaming ML algorithms.

SAMOA enables development of new ML algorithms without dealing with the complexity of underlying streaming processing engines (SPE, such as Apache Storm and Apache S4). SAMOA users can develop distributed streaming ML algorithms once and execute the algorithms in multiple SPEs, i.e., code the algorithms once and execute them in multiple SPEs.


SAMOA Slides

G. De Francisci Morales SAMOA: A Platform for Mining Big Data Streams Keynote Talk at RAMSS '13: 2nd International Workshop on Real-Time Analysis and Mining of Social Streams WWW, Rio De Janeiro, 2013.


NoSQL matters Conference, Barcelona 2013

SAMOA Developer's Guide

SAMOA Developer's guide

API Javadoc Reference

Mailing lists

SAMOA users:

SAMOA developers:


List of contributors to the SAMOA project.


The use and distribution terms for this software are covered by the Apache License, Version 2.0 (


Getting Started