0
I Use This!
Moderate Activity
Analyzed about 13 hours ago. based on code collected about 15 hours ago.

Project Summary

Declarative large-scale machine learning (ML) that aims at flexible specification of ML algorithms and automatic generation of hybrid runtime plans ranging from single-node, in-memory computations, to distributed computations on Apache Hadoop and Apache Spark.
ML algorithms are expressed in an R-like or Python-like syntax that includes linear algebra primitives, statistical functions, and ML-specific constructs. This high-level language significantly increases the productivity of data scientists as it provides (1) full flexibility in expressing custom analytics, and (2) data independence from the underlying input formats and physical data representations. Automatic optimization according to data and cluster characteristics ensures both efficiency and scalability.

Tags

cluster distributed dml hadoop java machine_learning pydml python spark

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    65% of companies leverage OSS to speed application development in 2016
  • ...
    compare projects before you chose one to use
  • ...
    there are over 3,000 projects on the Open Hub with security vulnerabilities reported against them
  • ...
    by exploring contributors within projects, you can view details on every commit they have made to that project

Languages

HTML
60%
Java
29%
JavaScript
7%
13 Other
4%

30 Day Summary

Feb 2 2026 — Mar 4 2026

12 Month Summary

Mar 4 2025 — Mar 4 2026
  • 221 Commits
    Down -191 (46%) from previous 12 months
  • 40 Contributors
    Down 0 (0%) from previous 12 months