StreamSets is a data integration platform that enables organizations to efficiently move and process data across various systems. It offers a user-friendly interface for designing, deploying, and managing data pipelines, allowing users to easily connect to various data sources and destinations. StreamSets also provides real-time monitoring and alerting capabilities, ensuring that data is flowing smoothly and any issues are quickly addressed.
Type | Title | Date | |
---|---|---|---|
Category | Data Integration | Dec 26, 2024 | Download |
Product | Reviews, tips, and advice from real users | Dec 26, 2024 | Download |
Comparison | StreamSets vs Azure Data Factory | Dec 26, 2024 | Download |
Comparison | StreamSets vs Informatica PowerCenter | Dec 26, 2024 | Download |
Comparison | StreamSets vs Informatica Intelligent Data Management Cloud (IDMC) | Dec 26, 2024 | Download |
Title | Rating | Mindshare | Recommending | |
---|---|---|---|---|
Informatica Intelligent Data Management Cloud (IDMC) | 4.0 | N/A | 93% | 181 interviewsAdd to research |
Azure Data Factory | 4.0 | 11.0% | 92% | 87 interviewsAdd to research |
StreamSets has a user-friendly interface which makes it easy to implement batch, streaming, or ETL pipelines and integrate with various platforms such as Snowflake, AWS, Google Cloud, and Azure. Its bifurcation feature is valuable as it allows for a good bifurcation rate with fewer mistakes. The data pipelines have a good design and are easy to implement without requiring technical skills. It is a containerized application, easy to use with Docker and Kubernetes. StreamSets has a variety of components and features that are useful for planning, executing, and monitoring pipelines. It is a no-code or low-code platform, which makes it easy to configure data sources and data output for easy configuration of data pipelines. It has an efficient scheduling system and a wide range of connectors for connecting to any data source. StreamSets also provides meaningful insights into data analytics platforms and has a powerful built-in Transformer feature. The Data Collector and Control Hub platforms are valuable to users, and the pipelines feature enables users to pull in and push out data from different sources and manipulate and clean things up within them. StreamSets is a powerful, modern data analytics solution that integrates a large volume of data from different sources and is easy to implement with good training material and user manuals.
StreamSets has room for improvement in several areas, including the complexity of the Transformer for Snowflake, the need for better documentation and explanation of nodes, a lack of user-friendliness in the interface and monitoring visualization, and difficulty in setting up nodes and pipelines. Users also desire better error logging and more features for reporting and organizing pipelines. Some reviewers suggest improvements to the loading mechanism, logging system, and ability to manually manipulate data. Additionally, there are concerns about the cost and the steep learning curve, as well as the need for more knowledge-based content and a better, more detailed version history. Some users also note issues with data processing speed and security.
StreamSets has provided a significant return on investment for many users. Users have reported savings in time and money, with some reporting up to 40% ROI. The solution has helped increase efficiency, revenue, and sales, with some reporting up to a 50% increase. It has also reduced errors, improved accuracy, and simplified the data ingestion and integration process. The solution has replaced the need for manual labor, resulting in significant cost savings. The solution has also provided security and safety, supporting various heterogeneous sources, resulting in increased profits. The time saved from using StreamSets has allowed users to focus on other tasks and increase productivity. Overall, users have reported a positive ROI from using StreamSets.
Opinions on the pricing and licensing for StreamSets are mixed. Some users find it affordable and cost-effective, while others consider it expensive, particularly for small businesses. Some suggest the pricing should be more flexible, depending on the intended use of the software. There are different versions and plans available, including a free trial and open-source options. The licensing cost varies from customer to customer, and some users feel the pricing could be improved for small and mid-size organizations.
StreamSets is primarily used for data integration and pipeline creation, with a focus on healthcare and banking industries. It allows for the sharing of data between platforms and the creation of pipelines that load data from various sources to the cloud or other destinations. It also enables real-time streaming and data transformation, and can be customized to suit specific needs and budgets. The tool is used by IT departments and data engineering teams for projects such as creating data lakes, delivering continuous data for database operations, and generating patterns and insights for data analysts and scientists. It is also used as a no-code option for machine learning integration.
StreamSets' customer service and support have mixed reviews. Some users rate their technical support as very supportive, knowledgeable, and responsive, while others complain about long wait times for responses to queries and the need for improvement in customer care services. Some users appreciate the effort StreamSets puts into resolving issues, such as providing customized patches, while others suggest they need more dedicated technicians with professional knowledge. Overall, StreamSets' customer service and support receive a rating between six and ten out of ten, with some room for improvement.
The initial setup for StreamSets varies depending on technical expertise and the number of data sources involved. Some found it easy and straightforward, while others found it complex and required assistance from the support team. The deployment time ranged from three days to one month, and maintenance requirements were minimal. It was generally agreed that the documentation was helpful, and the cloud-based platform made implementation easier. Some encountered issues with network and firewall, but overall, the implementation process was simple and lean.
StreamSets is generally considered to be a scalable solution that can be used by small to medium-sized enterprises with a relatively small number of users to larger organizations with thousands of concurrent users. It integrates well with major cloud platforms and offers a good way to store and process data. Users report that it can be easily scaled up or down depending on the volume and velocity of data being processed, but note that the cost of scalability can be a factor. Some users report that the underlying hardware can impact scalability, but overall StreamSets is viewed as a highly scalable solution that can be used across multiple departments and locations.
StreamSets is a highly stable solution with very little downtime reported by users. The stability has improved over time and is now considered a solid 10 out of 10. While there have been some latency and server speed issues, overall the stability of the cloud-based solution is highly rated. Some users have experienced minor issues, but they were quickly resolved by the support team.