Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Kettle

Compare

  Analyzed 1 day ago

K.E.T.T.L.E (Kettle ETTL Environment) is a meta-data driven ETTL tool. (ETTL: Extraction, Transformation, Transportation & Loading) This means that no code has to be written to perform complex data transformations. Environment means that it is possible to create plugins to do custom ... [More] transformations or access proprietary data sources. Kettle is released under the Apache License v2.0 since version 4.3.0. Kettle moved to GitHub at the beginning of 2014: http://github.com/pentaho/pentaho-kettle This move could lead to strange results in the Ohloh metrics. [Less]

1.25M lines of code

0 current contributors

1 day since last commit

23 users on Open Hub

High Activity
4.54545
   
I Use This

OpenJUMP

Compare

  Analyzed 1 day ago

OpenJUMP is an open source GIS software written in Java. It is based on JUMP GIS by Vivid Solutions. It is a Vector GIS that can read rasters as well. It is not just another free demo viewer, but you can edit, save, analyze etc. with OpenJUMP. It works, even with medium size datasets, and with ... [More] professional touc. It provides a GIS API with a flexible plugin structure, so that new features are relatively easy to develope around the sound mapping platform. It utilises standards like GML, WMS and WFS. It is already translated in several languages. [Less]

216K lines of code

0 current contributors

about 2 months since last commit

20 users on Open Hub

Low Activity
4.0
   
I Use This

OpenRefine

Compare

  Analyzed about 11 hours ago

OpenRefine is a free, open source power tool for working with messy data and improving it

185K lines of code

45 current contributors

1 day since last commit

4 users on Open Hub

Moderate Activity
5.0
 
I Use This

Teiid

Compare

Claimed by JBoss Analyzed about 6 hours ago

Teiid is The Enterprise Information Integration (virtual) Database. Teiid is a data virtualization system that allows applications to use data from hetergenous data sources. Teiid is comprised of tools, components and services for creating and executing bi-directional data services. Through ... [More] abstraction and federation, data is accessed and integrated in real-time across distributed data sources without copying or otherwise moving data from its system of record. [Less]

430K lines of code

7 current contributors

over 3 years since last commit

3 users on Open Hub

Inactive
0.0
 
I Use This

BioMart

Compare

  No analysis available

BioMart is a query-oriented data management system developed jointly by the European Bioinformatics Institute (EBI) and Cold Spring Harbor Laboratory (CSHL). The system can be used with any type of data and comes with a range of query interfaces and administration tools, including 'out of the ... [More] box' website that can be installed, configured and customised according to requirements. The system simplifies the task of creation and maintenance of advanced query interfaces backed by a relational database and it is particularly suited for providing the 'data mining' like searches of complex descriptive (e.g. biological) data. BioMart can work with existing data repositories by converting them to a required BioMart format as well as newly created databases. [Less]

0 lines of code

0 current contributors

0 since last commit

2 users on Open Hub

Activity Not Available
4.0
   
I Use This
Mostly written in language not available
Licenses: lgpl

BioDAS

Compare

  No analysis available

The Distributed Annotation System (DAS) defines a communication protocol used to exchange biological sequence annotations. DAS is a client-server system in which a single client integrates data from multiple servers. Data distribution, performed by DAS servers, is separated from visualization ... [More] , which is done by DAS clients. Little coordination is needed among the various information providers. DAS is heavily used in the genome bioinformatics community for sharing information about gene and protein sequences as well as protein structures. [Less]

0 lines of code

0 current contributors

0 since last commit

2 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: No declared licenses

Apache Daffodil

Compare

  Analyzed 1 day ago

Apache Daffodil - an implementation of the Data Format Description Language (DFDL)

552K lines of code

10 current contributors

8 days since last commit

1 users on Open Hub

High Activity
5.0
 
I Use This

Talend Open Studio General Purpose Components Collection

Compare

  Analyzed about 12 hours ago

A collection of general purpose maven-driven TOS components with various intended uses, from social network analysis to webservice connectors to tweet parsing.

10K lines of code

0 current contributors

about 8 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

professional-services-data-validator

Compare

  Analyzed about 3 hours ago

Utility to compare data between homogeneous or heterogeneous environments to ensure source and target tables match. The Data Validation Tool is an open sourced Python CLI tool based on the Ibis framework that compares heterogeneous data source tables with multi-leveled validation functions. ... [More] Data validation is a critical step in a data warehouse, database, or data lake migration project where data from both the source and the target tables are compared to ensure they are matched and correct after each migration step (e.g. data and schema migration, SQL script translation, ETL migration, etc.). The Data Validation Tool (DVT) provides an automated and repeatable solution to perform this task. [Less]

32.3K lines of code

0 current contributors

about 17 hours since last commit

0 users on Open Hub

Moderate Activity
0.0
 
I Use This
Licenses: No declared licenses

Talend Spatial extension

Compare

  Analyzed 1 day ago

Talend Spatial extension

108K lines of code

0 current contributors

about 10 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This