VMware Tanzu Data Solutions Reviews and Pricing

AANKITGUPTAA

Consultant at Pi DATACENTERS

Nov 15, 2022

Download

Open-source, reliable, and able to expand

Pros and Cons

"The solution is stable."

"It doesn't have any GUI-based monitoring tools."

What is our primary use case?

Postgres is a database. Like Oracle is a database, this is a database.

We replaced our Oracle paid database with the open-source Postgres database, and we migrated around 50 lakhs of consumer data there with different rows and tables. We deployed this in different production staging and testing. We created three deployments, and each deployment has three servers.

How has it helped my organization?

Earlier, we were using the Oracle paid database, which is a commercial product. Then we switched to open-source due to the fact that we have lots of new projects and we could not handle the licensing costs. So we migrated our data from the Oracle Database Server to the Postgres Server. It helped us to evaluate the cost. We saved lots of money in terms of licensing.

What is most valuable?

It is open-source.

It provides the database load-balancing capability itself. It has data, like Pgpool, an open-source database, and a load balancer also.

We can also create the cluster in between the database in active, standby mode.

The solution is stable.

It's scalable.

What needs improvement?

It doesn't have any GUI-based monitoring tools. Oracle has some proprietary tools for monitoring all the databases and all that. Postgres doesn't have any graphical capabilities where we can monitor the database. We have to do it with the Sierra stuff and run some random commands. Then we can get the data from the cluster and databases table.

The initial setup is complex.

It would be ideal if they could provide an active cluster in Postgres. If one primary DB goes down, it should automatically fail over to the second database.

Buyer's Guide

VMware Tanzu Data Solutions

March 2025

Free Report: VMware Tanzu Data Solutions Reviews and More

Learn what your peers think about VMware Tanzu Data Solutions. Get advice and tips from experienced pros sharing their opinions. Updated: March 2025.

DOWNLOAD NOW

848,716 professionals have used our research since 2012.

For how long have I used the solution?

We've been using it for four years. I've used it since around 2018.

What do I think about the stability of the solution?

It's stable. There are no bugs or glitches. It doesn't crash or freeze. It's reliable.

What do I think about the scalability of the solution?

It is very scalable.

From the user's perspective, we have around 500 users. However, we have around 50 lakhs of consumer records in the solution.

We plan to increase usage. We already added the sponsors, and we require the capacity and the transactional processing. Therefore, it's scalable. We don't have any licensing restrictions, so we can add on as required.

How are customer service and support?

While we don't have technical support, we do have creative support. They are quite good.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

We used to use Oracle Database.

How was the initial setup?

We required lots of planning during the initial setup. The migration phase is very complex when you move from Oracle to Postgres. The installation and configuration have a moderate amount of difficulty.

The deployment and maintenance require three people, including one system administrator and two database administrators.

What about the implementation team?

We handled the initial setup in-house. We didn't need any outside assistance.

What was our ROI?

We haven't invested any money into the solution and therefore haven't looked into ROI.

What's my experience with pricing, setup cost, and licensing?

We have zero licensing costs. The solution is open-source.

Which other solutions did I evaluate?

We also evaluated some parts of MySQL. However, we didn't find it very suitable and scalable.

What other advice do I have?

You need to be clear about your use cases and the transactional requirement you are observing from the database architecture before beginning with this solution. You need to consider your architecture based on the scalability and reliability of the applications. You need to take this into account before deploying any solution to Postgres.

I'd rate the solution eight out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Nasir Niamat

Principal Software Engineer - Db Ops at i2c Inc.

Jul 5, 2023

Download

An open-source solution with a good loading speed, but maintenance is time-consuming

Pros and Cons

"The loading speed is very good."

"Maintenance is time-consuming."

What is our primary use case?

We are using the product for analytical purposes like reporting and billing.

How has it helped my organization?

We maintain the servers on our premises. Compared to Snowflake, Greenplum is a cheap solution for analytical purposes.

What is most valuable?

The latest version is better than the older ones. The solution updates very fast. The loading speed is very good.

What needs improvement?

Maintenance is time-consuming. It takes time to VACUUM and ANALYZE the tables to remove the fragmentations.

For how long have I used the solution?

I have been using the solution for five years.

What do I think about the stability of the solution?

The solution is stable.

What do I think about the scalability of the solution?

Compared to Snowflake, Greenplum is not scalable. The solutions used on premises are not scalable compared to the cloud solutions. Around 200 to 300 people use the product in our organization.

How are customer service and support?

Support is fine. We do not use high-level support. The support team is quite supportive.

How was the initial setup?

The setup is easy. It is not complex.

What about the implementation team?

We must set up the instance and run scripts to deploy the product. It is very simple. We can deploy the scripts with one or two commands. One person is enough to deploy the solution.

What's my experience with pricing, setup cost, and licensing?

It’s an open-source solution. There are no expenses for using it.

What other advice do I have?

We are using the latest version of the solution. Some of our clients asked us why we were not using Snowflake, so we are evaluating Snowflake as per their request. If we replace Greenplum with Snowflake, the purpose would be to minimize maintenance time and enhance scalability. If someone is looking for a cheap solution, Greenplum is a good choice for them. Overall, I rate the product a six out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Buyer's Guide

VMware Tanzu Data Solutions

March 2025

Free Report: VMware Tanzu Data Solutions Reviews and More

Learn what your peers think about VMware Tanzu Data Solutions. Get advice and tips from experienced pros sharing their opinions. Updated: March 2025.

DOWNLOAD NOW

848,716 professionals have used our research since 2012.

Abdullah Jan Farooqui

Director Consulting Services at M3tech

Jul 17, 2023

Download

Uses a memory desk processor very efficiently and performs well while maintaining a low cost

Pros and Cons

"The solution's best feature is its exceptional speed, delivering efficient utilization of resources."

"The support feature could benefit from some improvement in terms of accessibility and responsiveness."

What is our primary use case?

We specifically use the solution for queuing purposes, and it has proven to be fantastic in that aspect.

How has it helped my organization?

What is most valuable?

The solution's best feature is its exceptional speed, delivering efficient utilization of resources. It uses a memory desk processor very efficiently. It offers high performance while maintaining a low cost.

What needs improvement?

The solution is a fine product. However, to make it perfect, in some cases, there might be a need to traverse the queue. RabbitMQ currently lacks the capability for archiving the queue, which essentially turns it into a log.

For such requirements, you may need to explore other options like Kafka or custom drivers that allow traversing the entire queue. In RabbitMQ, while you can traverse the entire queue, you need to devise a workaround to handle the messages. For example, you can read a message from one queue, publish it to another queue or keep it in some other way to retain the desired entries, and then stop at that point.

Additionally, the need for support may vary depending on the usage and potential heavy loads on the system. The support feature could benefit from some improvement in terms of accessibility and responsiveness.

I don't encounter significant challenges or areas that require improvement while using the solution. Everything works smoothly, and I find it well thought out. It's got excellent compliance with MQP 9.0. Overall, I have had a positive experience with the solution.

For how long have I used the solution?

I have been using the solution since 2017.

What do I think about the stability of the solution?

The solution is highly stable. As an example, at this moment, I am in front of my admin panel and can confirm that it has been running continuously for the past 173 days.

What do I think about the scalability of the solution?

The solution is scalable, although I still need to utilize the clustering option. A single server is sufficient and efficiently handles most of our workloads. It effectively uses system resources such as memory, CPU, and disks, resulting in excellent performance with minimal resource usage.

How are customer service and support?

So far, we have not needed any support from the solution's official support team or community. We rely on Google search and our team's research, leveraging various online resources to explore and implement solutions independently.

Which solution did I use previously and why did I switch?

When I joined my current company, I initially explored Apache Kafka, but I realized that Kafka is primarily a log system rather than a queuing system. I encountered limitations with Kafka, such as maintaining pointers for each process and manually removing messages from the queue.

Comparatively, RabbitMQ proved to be more convenient as it automatically deletes messages from the queue when using auto or manual acknowledgment. Considering these factors, we switched from Kafka to this solution due to its efficiency.

How was the initial setup?

The solution's installation process was straightforward, especially if you have good skills in installing software and a good command of Linux. Once the Bandit software is downloaded and extracted, the installation is completed.

After that, accessing the admin interface allows for a user-friendly GUI experience. The deployment process took around half an hour.

We have a private cloud infrastructure using VMware, which means our servers are running on-premises and are owned by our company. We have a limited number of servers running the solution.

Specifically, we have one primary server and one secondary server without implementing clustering. Replicating these two servers is sufficient for our workload, and they can be installed by a single system administrator in just half an hour without any issues, provided they have DPU-installed Linux available.

Overall, I would rate the setup experience as nine out of ten.

What's my experience with pricing, setup cost, and licensing?

The solution's pricing is cost-effective as it does not involve significant expenses. Licensing is required only for the server, while clients do not need any licensing. Therefore, it proves to be a cost-efficient option.

Which other solutions did I evaluate?

In my previous organization, we heavily relied on Tibco messaging solutions like Tibco RD (Rendezvous) and Tibco RV (Rendezvous) for the entire rating system. I have also explored Apache Kafka.

What other advice do I have?

If you are looking for a queuing system for your application that guarantees insured delivery and ensures single delivery without duplicates, RabbitMQ is the right solution as it provides all these capabilities with ease of use.

With RabbitMQ, your application doesn't need to worry about receiving duplicate messages as the solution handles that internally, ensuring that each message goes through a single process for one delivery.

I highly recommend the solution and would rate it an eight out of ten.

Which deployment model are you using for this solution?

Private Cloud

Disclosure: I am a real user, and this review is based on my own experience and opinions.

it_user264192

Senior Enterprise Technical Architect at a computer software company with 10,001+ employees

Aug 24, 2017

Download

We experience performance of approximately 1TB per hour loading data to Greenplum without the use of specialized hardware.

Pros and Cons

"Scalable (Massive) Parallel Processing (MPP) – The ability to bring to bear large amounts of compute against large data sets with Greenplum and the EMC DCA has proven itself to be very effective."

"We would like to see Greenplum maintain a closer relationship with and parity to features implemented in PostgreSQL."

What is most valuable?

Of particular value to our environment and applications are the following Greenplum capabilities:

Scalable (Massive) Parallel Processing (MPP) – The ability to bring to bear large amounts of compute against large data sets with Greenplum and the EMC DCA has proven itself to be very effective.
Fast load of data into Greenplum – We experience performance of approximately 1TB per hour loading data to Greenplum without the use of specialized hardware.
MADlib (madlib.net) – There are a number of statistical and analytical functions available within MADlib upon which we rely. Among these are linear regression, logistic regression, apriori, k-means, principle component analysis, etc.
User Defined Functions in Python (UDFs in PL/Python) – Where MADlib does not provide a direct solution to an application problem, the ability to quickly prototype and deploy user defined functions with Python has been effective.

What needs improvement?

We would like to see Greenplum maintain a closer relationship with and parity to features implemented in PostgreSQL. The current version of Greenplum is based on a fork of PostgreSQL v8.2.15. This edition of PostgreSQL was EOL by the PostgreSQL project on Dec 2011. The current version of PostgreSQL is v9.5.

For how long have I used the solution?

We began production use in November, 2011. Alongside Greenplum, we're also using EMC Data Computing Appliance v2.3.3 (8/10), of which we have two and a half racks in production, and one and a quarter racks in dev/tests.

What was my experience with deployment of the solution?

We had no issues with the deployment.

What do I think about the stability of the solution?

The only issues with stability we’ve experience have been the sporadic fail over of primary to mirror segments. The environment continues to operate in this instance with the failure of queries that were in flight at the time of the fail-over.

What do I think about the scalability of the solution?

We have had no issues with scalability whatsoever.

How are customer service and technical support?

The service and support we’ve received from both Pivotal and EMC has been exemplary. The exceptions to this would be:

The EMC Request for Product Qualification (RPQ) process – EMC DCA support is contingent upon EMC approval of all third party software installed onto a DCA. There have been times that this approval has taken as long as 60 days to process.
Root Cause Analysis of Greenplum Database Incidents – When Greenplum Database incidents have occurred (e.g. primary database segments failing over to their backup), and Pivotal has been called for support, the response has been near immediate (30 minutes or less). Additionally, the incident resolution provided has been equally expedient. Where this has caused some disappointment is the response to our request for a root cause of the incident. These requests tend to queue up and we don’t seem to get answers beyond the typical vendor response of “that’s been fixed in the next release”.

Which solution did I use previously and why did I switch?

The purchase of Greenplum was our first interaction with Pivotal. We have been a customer of EMC for a very long time.

What other advice do I have?

My primary reason for reducing points on this rating is due to the fact that Greenplum is based on a fork of PostgreSQL v8.2.15 (EOL by the PostgreSQL project on Dec 2011). The current version of PostgreSQL is v9.5. There are a number of current PostgreSQL features of which we would like to take advantage (JSON support, materialized views, full text search, XML support, column-based permissions, row-based permissions, etc.).

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Gyula Bereczky

Senior Data Engineer at a financial services firm with 10,001+ employees

Oct 18, 2020

Download

Powerful external data integration and parallel load capabilities, with good technical support

Pros and Cons

"The parallel load features mean that Greenplum is capable of high-volume data loading in parallel to all of the cluster segments, which is really valuable."

"The initial setup is somewhat complex and the out-of-the-box configuration requires optimization."

What is our primary use case?

Greenplum is a distributed database that we used for data warehousing.

What is most valuable?

The parallel load features mean that Greenplum is capable of high-volume data loading in parallel to all of the cluster segments, which is really valuable.

The service management capabilities are good.

The external data integration with Parquet, Avro, CSV, and unstructured JSON works well.

It has an advanced query optimizer.

What needs improvement?

The initial setup is somewhat complex and the out-of-the-box configuration requires optimization.

- OS settings need to be tuned according to the Install guide.

- Only group/spread mirroring by gpinistsystem, block mirroring is manual (Best Practices Guide)

- Db maintenance scripts are not supplied - some of them added in cloud - need to be implemented based on the Admin Guide.

- Comes with two query optimizers, PQO is default, some queries perform better with the legacy planner, it needs to be set.

For how long have I used the solution?

We have been working with Greenplum for about five years.

What do I think about the stability of the solution?

Greenplum is pretty stable.

What do I think about the scalability of the solution?

This product is absolutely scalable. We have more than 400 users in our database.

How are customer service and technical support?

The technical support is exquisite.

This is a company that really listens to its customers. I am very happy with our relationship.

Which solution did I use previously and why did I switch?

Before I joined this company, I used different data warehousing solutions.

Making the transition to Greenplum requires a completely different mindset because it is massively parallel. It's more like a Big Data mindset, where you need to consider that you are distributing data between cluster nodes. It is not always straightforward to make the switch.

How was the initial setup?

The initial setup is kind of complex. You need an expert to set up a Greenplum cluster.

It may not be possible to simplify the initial setup because there's an out of the box configuration and you can use it. I've actually seen companies using it for years and it works, but it didn't work optimally so they were not happy with the results.

You can set up Greenplum but you really need to read the manual and the installation guide. I've seen people skipping it and then complaining.

What about the implementation team?

A few people are enough to maintain this product. If you want to have around the clock support then you will need a couple of people in different time zones, but generally, maintenance is straightforward.

What other advice do I have?

We are currently in the process of upgrading from version 5.26 to 6.11 and I can already see a lot of improvements. I can't wait to try them. According to the roadmap, there are a lot of new improvements coming in the V7 version, which is due out next year.

My advice for anybody who is implementing Greenplum is that they really need an expert to assist them. They might hire consultants or grow experts in-house, although that takes time and it is not always straightforward. You can use Greenplum out of the box but to really leverage all of the capabilities, you definitely need to tune your system and also design your database objects.

When people think about a database they usually think about Oracle, Mircosoft SQL, or maybe MySQL. Greenplum is a distributed database that needs a completely different mindset. I think that when people start to use it, they don't really understand. For example, you cannot switch from Oracle to Hadoop because you will need the same change, but when they switch to Greenplum from Oracle, or just put data from Oracle to Greenplum, they don't consider this change as seriously as they would for Hadoop.

Overall, I am very happy with this product.

I would rate this solution a nine out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

PraveenKumar28

Packaged App development Senior Analyst at a consultancy with 10,001+ employees

Mar 19, 2024

Download

A cloud solution for asynchronous call with easy configuration

Pros and Cons

"The most valuable feature is asynchronous calls, which are easy to configure."

"We needed to configure additional plugins. While it was relatively easy to do this on-premises, it became more challenging in the cloud."

What is our primary use case?

We use the solution for the asynchronous call and POPS mechanism.

What is most valuable?

The most valuable feature is asynchronous calls, which are easy to configure.

What needs improvement?

We needed to configure additional plugins. While it was relatively easy to do this on-premises, it became more challenging in the cloud.

For how long have I used the solution?

I have been using VMware RabbitMQ for one year.

What do I think about the stability of the solution?

The product is stable. We haven't faced any issues.

What do I think about the scalability of the solution?

VMware RabbitMQ needed to be a more scalable product. It wouldn't perform consistently if you wanted to add workload or users or reduce workload. We faced problems with it during heavy loads. The cloud version is scalable. We can scale it up or down based on our requirements, such as the number of users or workload.

Around seven or eight people were in every group, and many teams were using it for virtual use.

How are customer service and support?

The infrastructure was handled by a person responsible for configuration and related tasks. I primarily focused on configuring connections as a developer. I could handle it by installing components like plugins. When issues arose in the cloud environment, we escalated them to the support channels.

How was the initial setup?

We installed VMware RabbitMQ on a local computer. We are currently using Docker and Kubernetes for deployment in our local environment. It was relatively easy to deploy compared to an on-premise system.

The solution can be a bit challenging to handle. Not every configuration and deployment works seamlessly. It depends on the project team and compatibility, but is relatively easy to use.

What's my experience with pricing, setup cost, and licensing?

The product is free of cost.

What other advice do I have?

We configure the monitoring and alerting for the RabbitMQ setup. We transfer the message to the designated queue if there are any errors or similar issues. We use a Spring Boot application and microservices for this purpose, making it easy to route the message.

If you want to use this solution, you first need to understand the concept of exchange queues. Certain clusters require specific knowledge. The configuration may vary depending on the application type. For instance, the configuration was relatively straightforward in our case with microservices. We only needed to provide authentication and the correct URL. If it ran on a cloud environment, we would provide the instance, username, and password, and the configuration would be handled automatically. It would depend on the language and the specific type of microservice or application for more advanced customization, such as writing code.

The solution is easy to use, configure, and install.

Overall, I rate the solution an eight or nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Michael Grayson

Director of IT Operations at a financial services firm with 10,001+ employees

Feb 27, 2023

Download

Its most valuable feature is the ability to quickly run through data, but they should improve product performance

Pros and Cons

"It works very well with large database queries."

"If you have a user consuming a huge load of resources, it takes down the entire system."

What is our primary use case?

My team uses Greenplum for automating a lot of job writing scripts using PSQL. We use it to automate reports. Also, I use the identity workbench for logging in. In addition, I use DBeaver as well.

How has it helped my organization?

It works very well with large database queries. It works efficiently even in cases where you have a query worth terabytes of data.

What is most valuable?

Its most valuable feature is the ability to quickly run through data. It has a good processing speed as well.

What needs improvement?

They should improve product performance. It does not share resources well whenever it has more than one process running. If you have a user consuming a huge load size of resources, it takes down the entire system.

They should work on resource pulling and sharing those resources properly. At present, it allows one user to completely take over everything.

For how long have I used the solution?

We have been using the solution for eight or nine years.

What do I think about the scalability of the solution?

I would rate the solution about seven or eight out of ten for its scalability. Around 30-40 people in my organization are using the solution. We are not looking to increase usage.

You eventually have to get new hardware because the CPU stops quickly once the load increases. It's only limited to hardware. You don’t realize you need new hardware until you have to because it doesn’t have throttles.

Which solution did I use previously and why did I switch?

We have used Snowflake, Oracle, and many such solutions.

How was the initial setup?

The setup seems pretty straightforward.

What other advice do I have?

The solution has its limitations. It works better with a serial-threaded program than a multi-threaded program. If you go for the solution knowing that, you should be fine. I would rate the solution as a six out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Teodor Muraru

Developer at Emag

Jan 13, 2022

Download

A good tool that's simple to use and is great for messaging

Pros and Cons

"Companies can scale the solution, so long as they have server room."

"The user interface could be improved."

What is our primary use case?

We primarily use the solution for consumers and publishers. It's for messaging and consumer publishing. That's it.

What is most valuable?

The solution is simple to use.

It's great for messaging and consumer publishing.

Companies can scale the solution, so long as they have server room.

The stability is good.

What needs improvement?

The user interface could be improved. We have an interface that shows the consumption rate, the number of consumers, their occupation rate. We should have a column in that interface that shows the estimated time until, at the current rate of consumption, the number of messages is to be consumed from a specific queue. That would be great. I wanted to read, however, as it is right now, JavaScript would have loaded the browser too much. Basically, I'd just like to see the consumption rate in each queue without too much fuss.

The solution could use some plugins that could be integrated into the server installation. We had a plugin that we used to delay something that from one version to the other was integrated into the server setup. Maybe it was more of an extension. However, more plugins could be also be integrated into newer versions of Rabbit.

For how long have I used the solution?

I've used the solution since 2013 or 2014. It's been about eight years at this point.

What do I think about the stability of the solution?

The solution is usually stable. We have problems with space on the Rabbit servers. When they are full, we might lose everything. That's a big no-no. This is a problem for Kafka as well, however, we have higher thresholds in that area. Rabbit is the poor brother to Kafka, so it receives less space. That's why, sometimes, in some departments, this problem occurs.

What do I think about the scalability of the solution?

The solution can scale, however, we use a lot of space for Kafka. We have clusters through the servers, and there may be more for each department. If some needs appear, we can increase the number of servers in a cluster to better manage messages. As long as your company can increase the number of servers, it can scale.

We have about 100 departments that use this solution in some way.

In our case, we have in our department five people and we have two clusters with Rabbit for two different directions. For us, it's enough. We do not plan to increase usage.

How are customer service and support?

I've never directly contacted technical support. We use recommendations on the site, which is very good. I appreciate the recommendations, however, I'm not sure about the maintenance of the documentation from one version of Rabbit to the other. The older versions of the documentation might be less accurate.

Which solution did I use previously and why did I switch?

Other departments might use, for example, Kafka, however, I'm unsure as I have no visibility on them.

How was the initial setup?

I was there when the solution was initially implemented and, from what I recall, it took half a year.

It was completely new. No one knew anything about it. However, we knew that we had to do something to improve the communication between departments. It was a good solution. That said, it took a long time before everyone understood how it works.

We had a few dedicated people who liked the idea of Rabbit and implemented it. It took a while for the rest of the company to get behind them and learn how to do it.

There are one or two people at any given time available to handle any type of maintenance responsibilities.

What about the implementation team?

We handled the implementation process ourselves.

What other advice do I have?

We're using a few different versions. It depends on the department. Some departments have the latest, some don't, some use a very old version. I'm using 3.8. We do have plans to make an upgrade.

It was a few years ago now when I learned this process of separating publishers versus consumers in terms of messages and communicating between departments. This was the biggest game changer for myself. I'd advise new users study that aspect and understand it.

I'd rate the solution at an eight out of ten. It's a very good tool and we use it all the time.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.