Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Tajo

Compare

Claimed by Apache Software Foundation Analyzed about 9 hours ago

A Distributed Data Warehouse System for Hadoop

257K lines of code

0 current contributors

over 5 years since last commit

3 users on Open Hub

Inactive
0.0
 
I Use This

infovore

Compare

  Analyzed about 9 hours ago

RDF-Centric Map/Reduce Framework and Freebase data conversion tool

16.6K lines of code

0 current contributors

almost 11 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This

Apache Sqoop

Compare

Claimed by Apache Software Foundation Analyzed 7 months ago

Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

328K lines of code

10 current contributors

over 4 years since last commit

2 users on Open Hub

Activity Not Available
0.0
 
I Use This

Stetl - Streaming ETL

Compare

  Analyzed 1 day ago

Stetl provides a toolset for Streaming ETL of geospatial data. Stetl uses existing transformation tools like GDAL/OGR and XSLT and is glued through Python. A config file specifies the ETL chain of modules. Stetl is speed-optimized by using native calls like ogr2ogr, libxml and libxslt (via lxml). ... [More] Stetl is in particularly meant in ETL-cases where either ogr2ogr or XSLT alone is not sufficient. This mostly involves schema transformations as required for INSPIRE and local GML-based complex datasets. Stetl has proven to be able to handle 10's of millions of features/records through a technique dubbed "gml-splitting" (and outputting into a deegree blobstore, which could be dubbed "gml-spitting"). [Less]

7.67K lines of code

3 current contributors

over 1 year since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This
Tags etl gml

talend-bridge-api

Compare

Claimed by The OW2 Consortium Analyzed about 18 hours ago

API to help developers to build Talend Open Studio components in a OOP way. This API provide accessors, data structures, lists, an ORM layer, Talend type checking and connectors to help build and debug TOS components minimizing the amount of lines of code that need to be written in JET template files (pain!)

1.22K lines of code

0 current contributors

over 11 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

The DataTank

Compare

  Analyzed 1 day ago

The DataTank is a project which can turn your open data policy into gold

218K lines of code

0 current contributors

over 5 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This
Licenses: No declared licenses

Talend Open Studio General Purpose Components Collection

Compare

  Analyzed about 17 hours ago

A collection of general purpose maven-driven TOS components with various intended uses, from social network analysis to webservice connectors to tweet parsing.

10K lines of code

0 current contributors

over 8 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

etl-unit

Compare

  Analyzed about 22 hours ago

Unit testing framework for ETL projects

87K lines of code

0 current contributors

6 months since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This

etl-agent

Compare

  Analyzed 1 day ago

Remote agent for etl-unit

14.9K lines of code

0 current contributors

about 8 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

WebServicesHubClient

Compare

  Analyzed about 23 hours ago

A client for the informatica web services hub.

4.06K lines of code

0 current contributors

over 8 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses