Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Tajo

Compare

Claimed by Apache Software Foundation Analyzed about 3 hours ago

A Distributed Data Warehouse System for Hadoop

257K lines of code

0 current contributors

almost 4 years since last commit

3 users on Open Hub

Inactive
0.0
 
I Use This

infovore

Compare

  Analyzed about 19 hours ago

RDF-Centric Map/Reduce Framework and Freebase data conversion tool

16.6K lines of code

0 current contributors

over 9 years since last commit

2 users on Open Hub

Inactive
5.0
 
I Use This

Apache Sqoop

Compare

Claimed by Apache Software Foundation Analyzed 3 months ago

Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

328K lines of code

10 current contributors

about 3 years since last commit

2 users on Open Hub

Activity Not Available
0.0
 
I Use This

Talend Open Studio General Purpose Components Collection

Compare

  Analyzed about 23 hours ago

A collection of general purpose maven-driven TOS components with various intended uses, from social network analysis to webservice connectors to tweet parsing.

10K lines of code

0 current contributors

over 6 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

Palo ETL Server

Compare

  No analysis available

Palo ETL Server is a Java based Tool for Extraction, Transformation and Loading of mass data into the Palo OLAP Server - This project is integrated into Palo BI Suite and will no longer be updated.

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
3.0
   
I Use This
Mostly written in language not available
Licenses: No declared licenses

Alfresco ETL Connector

Compare

  Analyzed about 5 hours ago

The ETL Connector extension for Alfresco allows to import documents in an Alfresco repository by using compatible ETL Tools (for now Talend). It also provides an ETL client library that makes it easy to integrate in any ETL tool.

8.98K lines of code

0 current contributors

over 11 years since last commit

1 users on Open Hub

Inactive
5.0
 
I Use This

VIVO Harvester

Compare

  No analysis available

Challenges exist with the source of any data for a scientific profile, be it time constraints of the researcher for hand entry, validity, or provenance. In an effort to reduce the burden of manual data entry, ensure validity, and establish provenance the VIVO team at UF has created a system of tools ... [More] to ingest data from authoritative and aggregate sources in an automated fashion. [Less]

0 lines of code

0 current contributors

0 since last commit

1 users on Open Hub

Activity Not Available
0.0
 
I Use This
Mostly written in language not available
Licenses: BSD-2-Clause

refine-client-py

Compare

  Analyzed about 23 hours ago

The Google Refine Python Client Library provides an interface to communicating with a Google Refine server.

1.41K lines of code

0 current contributors

over 9 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

Stetl - Streaming ETL

Compare

  Analyzed 1 day ago

Stetl provides a toolset for Streaming ETL of geospatial data. Stetl uses existing transformation tools like GDAL/OGR and XSLT and is glued through Python. A config file specifies the ETL chain of modules. Stetl is speed-optimized by using native calls like ogr2ogr, libxml and libxslt (via lxml). ... [More] Stetl is in particularly meant in ETL-cases where either ogr2ogr or XSLT alone is not sufficient. This mostly involves schema transformations as required for INSPIRE and local GML-based complex datasets. Stetl has proven to be able to handle 10's of millions of features/records through a technique dubbed "gml-splitting" (and outputting into a deegree blobstore, which could be dubbed "gml-spitting"). [Less]

7.67K lines of code

3 current contributors

14 days since last commit

1 users on Open Hub

Very Low Activity
0.0
 
I Use This
Tags etl gml

talend-bridge-api

Compare

Claimed by The OW2 Consortium Analyzed 1 day ago

API to help developers to build Talend Open Studio components in a OOP way. This API provide accessors, data structures, lists, an ORM layer, Talend type checking and connectors to help build and debug TOS components minimizing the amount of lines of code that need to be written in JET template files (pain!)

1.22K lines of code

0 current contributors

almost 10 years since last commit

1 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses