Mahout: brief description

Mahout: brief description
Svetlana Komarova

Svetlana Komarova

The author. System Administrator, Oracle DBA. Information technology, internet, telecom. More details.

Mahout is an open source framework that can run common machine learning algorithms on massive datasets. To achieve that scalability, most of the code is written as parallelizable jobs on top of Hadoop. It comes with algorithms to perform a lot of common tasks, like clustering and classifying objects into groups, recommending items based on other users’ behaviors, and spotting attributes that occur together a lot. In practical terms, the framework makes it easy to use analysis techniques to implement features such as Amazon’s “People who bought this also bought” recommendation engine on your own site.

It’s a heavily used project with an active community of developers and users, and it’s well worth trying if you have any significant number of transaction or similar data that you’d like to get more value out of.


Вас заинтересует / Intresting for you:

Collecting Data in Real Time, ...
Collecting Data in Real Time, ... 532 views Валерий Павлюков Wed, 13 May 2020, 05:06:04
Business Analytics: IT Use Cas...
Business Analytics: IT Use Cas... 1007 views Даниил Fri, 05 Oct 2018, 16:48:44
BSON: brief description
BSON: brief description 830 views Светлана Комарова Wed, 09 Oct 2019, 08:02:39
Introduction to OLAP: basic co...
Introduction to OLAP: basic co... 3095 views Akmaral Wed, 03 Jan 2018, 06:21:04