4
I Use This!
Activity Not Available

News

Analyzed over 1 year ago. based on code collected over 1 year ago.
Posted 12 days ago by Leslie Handmaker
The best way to understand something is through concrete examples. I’ve put together seven examples of data pipelines that represent very typical patterns that we see our customers engage in. These are also patterns that are frequently encountered by data… The post 7 Examples of Data Pipelines appeared first on StreamSets.
Posted 14 days ago by Leslie Handmaker
Modern Data Integration Technology Is Helping With the drive to modernize data infrastructure on cloud technologies, one would be forgiven for thinking mainframe systems are destined to go the way of the dodo. The truth is, far from extinction ... [More] , mainframes… The post Mainframe Data Is Critical for Cloud Analytics Success—But Getting to It Isn’t Easy appeared first on StreamSets. [Less]
Posted 15 days ago by Leslie Handmaker
Spark is a widely used platform for businesses today because of its support for a broad range of use cases. Developed in 2009 at U.C. Berkeley, Apache Spark has become a leading big data distributed processing framework for its fast,… The post How to Use Spark for Machine Learning Pipelines (With Examples) appeared first on StreamSets.
Posted 21 days ago by Leslie Handmaker
One crucial part of Big Data is streaming data. As the name suggests, streaming data refers to data that undergoes continuous generation from multiple sources like social media, CRM, and ERM platforms. Handling and analyzing streaming data can be complex,… The post Spark Streaming appeared first on StreamSets.
Posted 28 days ago by Leslie Handmaker
One of the most exciting new capabilities of StreamSets DataOps Platform is its ability to dynamically provision Public Cloud VMs running Data Collector or Transformer for Spark engines.  Public cloud VMs running StreamSets engines can be deployed ... [More] “just in time”… The post Use StreamSets Dynamic Engine Deployment to Reduce Public Cloud Infrastructure Costs appeared first on StreamSets. [Less]
Posted about 1 month ago by Leslie Handmaker
Calling All Data Enthusiasts Today, most companies only have access to data at the tip of the proverbial iceberg. Yet, the greatest value may come from the depths of your systems. Imagine the insights your teams can uncover when you… The post A Roadshow Event Designed To Help You Unlock Your Data appeared first on StreamSets.
Posted about 2 months ago by Leslie Handmaker
Faster data access and easier collaboration among data teams are two key factors that help drive productivity for most data-driven organizations. However, achieving this becomes more complex with the exponential growth of data as business needs grow. ... [More] One way to… The post Data Mesh vs Data Fabric Architectures: What You Should Know appeared first on StreamSets. [Less]
Posted 2 months ago by Leslie Handmaker
Every data-driven enterprise looking to get the most out of its data has a continuous hunt for a cost-effective, scalable, high-performance storage and analytics architectural solution. Organizations have moved from traditional data warehouses to ... [More] data lakes and are now shifting… The post 5 Examples of Cloud Data Lakehouse Management in Action appeared first on StreamSets. [Less]
Posted 2 months ago by Leslie Handmaker
A successful data migration strategy involves moving data from one source to another with as little friction as possible. That friction usually comes in cost, data loss, or downtime accessing the target or destination data sources. A good migration design… The post Create a Successful Data Migration Strategy appeared first on StreamSets.
Posted 2 months ago by Leslie Handmaker
Quality data is key in decision-making. But, demanding big data to be processed in seconds creates a lot of pressure on data engineering systems and can impact the accuracy of the processed data. Large datasets are usually processed by batch… The post Understanding the Lambda Architecture appeared first on StreamSets.