0
I Use This!
Moderate Activity
Analyzed 1 day ago. based on code collected 1 day ago.

Project Summary

Declarative large-scale machine learning (ML) that aims at flexible specification of ML algorithms and automatic generation of hybrid runtime plans ranging from single-node, in-memory computations, to distributed computations on Apache Hadoop and Apache Spark.
ML algorithms are expressed in an R-like or Python-like syntax that includes linear algebra primitives, statistical functions, and ML-specific constructs. This high-level language significantly increases the productivity of data scientists as it provides (1) full flexibility in expressing custom analytics, and (2) data independence from the underlying input formats and physical data representations. Automatic optimization according to data and cluster characteristics ensures both efficiency and scalability.

Tags

cluster distributed dml hadoop java machine_learning pydml python spark

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    nearly 1 in 3 companies have no process for identifying, tracking, or remediating known open source vulnerabilities
  • ...
    anyone with an Open Hub account can update a project's tags
  • ...
    use of OSS increased in 65% of companies in 2016
  • ...
    check out hot projects on the Open Hub

Languages

HTML
61%
Java
28%
JavaScript
7%
13 Other
4%

30 Day Summary

Jul 23 2025 — Aug 22 2025

12 Month Summary

Aug 22 2024 — Aug 22 2025
  • 382 Commits
    Up + 63 (19%) from previous 12 months
  • 40 Contributors
    Up + 4 (11%) from previous 12 months