Try our new research platform with insights from 80,000+ expert users
Founder at pt. sma
Real User
Has many existing algorithms that we can use but it should have the ability to create higher-level presentations
Pros and Cons
  • "They have many existing algorithms that we can use and use effectively to analyze and understand how to put our data to work to improve what we do."
  • "The product should provide more ways to import data and export results that are user-friendly for high-level executives."

What is our primary use case?

In our company, we have about 2000 workers. I use K-means algorithms all the time. So, for example, say we want to classify workers to do better studies by groupings. Maybe we want to target groups of workers who tend to make more errors so we can concentrate on reducing the effects of this issue. We can further classify the grouping in terms of age and in terms of year of services in the company or other ways that may be helpful. After we have found the grouping of people, we can do other types of analysis and the result of the study can be used to apply solutions. In this case, where we are looking to reduce errors, it might be a solution to deploy more supervision and evaluate more of the variables from the group of the employee that we have isolated to analyze performance issues. For example, we may want to reduce excessive overtime to enhance alertness. We are use TFS (Team Foundation Server) and we use the K-means algorithm to help optimize the behaviors in the groups that we identify.  

I think in the future this kind of algorithm will be used widely by people to predict many things and to improve the performance of businesses and individuals.  

What is most valuable?

I like that the solution provides many built-in options for us. They have many existing algorithms that we can use. Using them effectively depends on us because we have to understand how to utilize any particular solution. But actually, the product is at a good level when it comes to ease-of-use. You have to have the data first to work with. You retrieve data you want to use from the database and you format the data and put it to the SPSS. Then you do analysis and from that, you can take better steps towards a business solution based on the results the analysis returns.  

What needs improvement?

The areas of SPSS Statics that have room for improvement — where it can be simplified to make it better — have to do with how you connect the SPSS solution with the data. They could provide enhancements like a query console so we can connect to the database and feed data more easily into SPSS. That could be really beneficial and save time and effort. Just enhancing the tools to collect the data would be significant.  

One other enhancement I would like to see is the ability to create higher-level presentations. When you are finished processing the data, you come up with the result. If you are a technician or a data analyst it is not so hard to look at the data. But what I think would be a nice addition is to create presentations. The bosses at a higher level do not want to see the raw data and to show the data is not so nice and easy for them to understand. Some ways to do more to create a presentation — something that is easier to look at and understand — would be good. The ability to do this should be improved.  

They need to improve the ability to connect to the databases, provide a query console and query windows. The engine for processing is there already, but the data feed has to be better. Then provide ways to present the result differently. Maybe graphic presentation options would be good so understanding the results can be more intuitive for the user.  

For how long have I used the solution?

We have been using the IBM SPSS solution for about a year.  

Buyer's Guide
IBM SPSS Statistics
November 2024
Learn what your peers think about IBM SPSS Statistics. Get advice and tips from experienced pros sharing their opinions. Updated: November 2024.
816,406 professionals have used our research since 2012.

What do I think about the stability of the solution?

The stability of SPSS Statistics doesn't seem to have bugs, glitches, or crashes. It's pretty stable and it functions well under the loads we have tested. The number of SPSS users is currently not more than 10 people. The data that we process with the SPSS is never really more than 2000 rows. So we only know that it is stable at this level of usage so far. If there is an opportunity to process 100,000 rows, I don't know what the performance would be. From what I know we would not see any problems, but so far our usage is limited.  

What do I think about the scalability of the solution?

While we don't really know yet what the performance will be like under greater loads for big data, I am pretty confident that the product is very scalable.  

How are customer service and support?

We have had contact with the IBM SPSS technical support team through the phone and they help us through what they call the TeamViewer. They can assist us promptly and I think the service is good. We are satisfied with the level of technical support we receive.  

Which solution did I use previously and why did I switch?

We did not use a different solution because we are trying to stay with Oracle and IBM solutions for compatibility and dependability.  

How was the initial setup?

The initial setup of the product is easy.  

What about the implementation team?

We did not do the deployment ourselves, we had someone do it. Even though it was not me who did the deployment, I know how they did it and was involved with the group who did it. So I am experienced with the initial setup but asked for assistance from integrators to not divert internal resources and be sure it was done right.  

What other advice do I have?

My advice to other users is that if they are not doing proper data analysis that there are other, better solutions like SPSS to predict and to analyze the data. Assuming the data is already there, they may just be printing a report or pulling a PDF or filling information into the database. But they are not really doing what they can with the data to use it.  

Now there are better solutions we can use and should use to produce information from the data in a way we did not before. It is a better way and the way of the future for data analysis. We need to use predictive algorithms and AI to get more from the information we collect and use it to create better solutions for our businesses. Data analysis is different now than it was only maybe 5 years ago. We can perfect the data and come up with facts from that analysis that we can use for predictions that will help make looking into the future with prediction more viable and accurate.  

On a scale from one to ten where one is the worst and ten is the best, I would rate IBM SPSS Statistics as a 7 out of 10. To improve that score they need to enhance the data access and export so that it will be less work, add dashboards for queries and give more options for reporting output.  

Which deployment model are you using for this solution?

On-premises
Disclosure: My company has a business relationship with this vendor other than being a customer:
PeerSpot user
it_user7365 - PeerSpot reviewer
Other non-IT at a non-profit with 501-1,000 employees
Vendor
Review of IBM SPSS Text Analytics for Surveys

Coding qualitative data is a time-consuming and often costly aspect of the survey process. In this post, I will provide feedback on IBM SPSS Text Analytics for Surveys (STAS), which is software designed specifically for coding survey data. The benefits of using this software include improved efficiency and consistency when coding text.

STAS allows you to set up a customizable coding algorithm to code your data. You can create your own coding scheme, or you can use one that is already built in. The built-in coding mechanism can identify responses that mention a place, a name, a positive or negative opinion, an opinion of how affordable an item is, and a number of other types of responses. One benefit of STAS is that it recognizes synonyms and alternative versions of words, making the task of setting up a coding algorithm more efficient.

I recently used STAS for two very different projects and found that it worked great for some purposes and not so great for others. If you are considering STAS, you may want to answer the following questions to inform your decision.

1.      How much data do I have?

STAS is optimized for coding 1,000 responses or fewer, each about 40 characters long. I used it to code about 5,000 open-ended survey responses at a time and it still worked reasonably well.

On the other hand, I also used STAS to code over 100,000 Tweets for research into whether Twitter data can supplement survey data. I set up the coding system (STAS calls this a “Text Analysis Package”) with a subset of Tweets, then I used it to code 25,000 Tweets at a time. Initially, I tried coding all 100,000+ Tweets simultaneously, but STAS could not handle it, so I resigned to coding 25,000 at a time and waiting an hour or two while the process was running. See here and here.

2.      Will I be coding more data?

Depending on how complicated your dataset is, setting up the coding system can take a lot of time. You’ll want to consider whether it is worth the initial time investment to set up the coding system, because for a low volume of complicated data, manual coding may be more efficient. For longitudinal studies especially, it might be worth that initial investment of time because once the system is set up you will be able to code subsequent waves of data with minimal effort.

3.      How much will the responses vary?

I found that STAS worked best for coding demographic data because the range of responses was limited. It worked reasonably well for coding types of injuries (also somewhat limited), but it did not work as well for coding relationships between people. These responses ranged from simple responses such as “mother,” “friend,” or “teacher,” to complicated responses that were typically unrepeated in the dataset. The complicated responses were along the lines of “my best friend’s boyfriend’s step-mom’s boss.” I’m sure STAS can be set up to accurately code these sorts of responses, but I had a hard time figuring it out (see #5 below) and quite frankly, it’s probably more efficient to manually code these responses.

4.      Would I be relying on STAS for sentiment analysis?

STAS is capable of coding sentiment, but I would test it carefully to see how it works with your data. As part of our Twitter research, we manually coded a random sample of 500 Tweets in our dataset and found that STAS sentiment coding was in agreement with manual coding only 44% of the time. STAS would likely perform better with survey data than Tweets, which often use unconventional language, but I would still recommend proceeding with caution if you plan to use STAS for sentiment analysis.

5.      Am I able to take a course on STAS?

It took several days of working with test data and poring over the user’s manual (which I was not impressed with) for me to really figure out what to do with STAS. I know enough about STAS to get by, but I have also come to realize just how much I don’t know about STAS. I encourage you to take a STAS course if you are able. Learning new software is usually easy for me, but without any training, I really struggled with STAS.

If you decide to proceed with STAS for coding your data, here’s one tip as you get started. Run your data through spell check (e.g. in Word or Excel) before importing into STAS. STAS catches many spelling errors, but not all. Anything you can correct will speed up the coding process.

Do you have additional STAS tips to share? Has your experience with STAS been similar to or different than mine? Would you recommend something besides STAS for coding?

Please comment below!

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user1068 - PeerSpot reviewer
it_user1068Tech Support Staff at a tech company with 51-200 employees
Vendor

Course on STAS - I was totally new in using this program back in 2008. I took a course that lasted approximately three months. Although this SPSS program seemed somehow difficult at first, it was not in the long run. For those without knowledge of this software can take up a course. However, I believe it is a program that anyone who is determined can learn how to use on their own without spending a dime on training materials. Thank you Ashley for sharing with us these extensive tips on STAS.

Buyer's Guide
IBM SPSS Statistics
November 2024
Learn what your peers think about IBM SPSS Statistics. Get advice and tips from experienced pros sharing their opinions. Updated: November 2024.
816,406 professionals have used our research since 2012.
Consultora asociada especialista en Ciencia at a consultancy with 11-50 employees
Real User
Very flexible with a short learning curve
Pros and Cons
  • "The most valuable features are the small learning curve and its ability to hold a lot of data."
  • "I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."

What is our primary use case?

My primary use cases are statistic analysis and data governance.

What is most valuable?

The most valuable features are the small learning curve and its ability to hold a lot of data.

What needs improvement?

I would like SPSS to be integrated with Cloud Pack For Data

For how long have I used the solution?

I've been using this solution for fourty years.

What do I think about the stability of the solution?

This product is stable.

What do I think about the scalability of the solution?

This solution's scalability is good.

How was the initial setup?

The initial setup was very simple and took only a day or two.

What's my experience with pricing, setup cost, and licensing?

Licenses are available on a yearly basis. There are no additional costs, everything is standard.

What other advice do I have?

This is a very flexible product that can be used for numerous purposes. It's also very easy to teach others to use it. I would rate this solution as ten out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
PeerSpot user
Senior Research Analyst at a university
Vendor
​Custom tables and macros allow us to create useful reports quickly for a broad audience
Pros and Cons
  • "Custom tables and macros: They allow us to create useful reports quickly for a broad audience."
  • "Better documentation on how to use macros."

What is most valuable?

Custom tables and macros: They allow us to create useful reports quickly for a broad audience.

How has it helped my organization?

We can provide internal benchmarks that allow specific units to compare themselves to others and the whole organization.

What needs improvement?

  • Scripts are clunky to work with.
  • Better documentation on how to use macros.
  • More affordable training for new staff members.

For how long have I used the solution?

I have used it for 17 years.

What do I think about the stability of the solution?

Some of the versions have issues. It is not always made public that a “fix” is available. Many hours can be wasted thinking you have something wrong in your code when it really is an issue with the newest version.

What do I think about the scalability of the solution?

I have used SPSS with 100,000 records. At that point, it was reaching its limit. Processes would not run, or would run incompletely. I am told for very large files, SAS may be better, but I have not confirmed this.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user7362 - PeerSpot reviewer
Other non-IT at a aerospace/defense firm with 51-200 employees
Vendor
IBM SPSS Text Analytics - Any Good? Yes.

I recently discovered that IBM bought SPSS a few years ago and is now providing a Text Analytics package called IBM SPSS Text Analytics for Surveys (producing the acronym STAS, which is either a stats package or an STD). I thought I'd take it out for a test drive so I downloaded a 14 day trail version. Before using, I reviewed these excellent tutorials: Analytics Blog and RTI's SurveyPost blog.

For data I could have used their sample data, but I decided to download my Twitter archive. Unfortunately, this caused me some pre-processing hassle. You see, STAS is technically designed for survey data and it expects unstructured language to be in the form of comment responses to questions and it expects those comments to be stored in single cells within a column in a spreadsheet (I see no reason in principle why it couldn't used for analyzing any unstructured data. You just have to package the data in a format SPSS will accept, namely a spreadsheet with the unstructured data all in one column).

Also, STAS does not directly ingests CSV files or Open Office ODC files. Apparently it only accepts inputs of four types: its own file type, Excel, ODBC, and what they call “Data Collection” which I haven't investigated.

Once you open a file, you are asked to drag-and-drop the column name containing your language data into an "Open Ended Text" box (refer to Analytics Blog for screen shots). While I appreciate the simplicity of the drag and drop functionality, my Twitter data had tokens separated into separate columns (which I thought was weird. Let me do my own tokenization, please!). STAS' functional choice means I needed to pre-process my data files. I had to merge the many token columns containing language tokens into a single column. Document pre-processing is common in language analysis, but STAS is supposed to be a platform easy to use for non-engineers. These file ingest and pre-processing steps are tedious and uninteresting and exactly why most people get frustrated. These things can be automated and it is a platform like STAS that ought to be doing this for me.

Also, it seems to only ingest a single file at a time. My Twitter data came to me separated by month so I have 38 files. I can manually merge them, but more work for me. Really no reason STAS can't let me select multiple files all formatted identically, then merge if necessary.

I was surprised and impressed that the software immediately offered me an opportunity to translation non-English comments with a single click. Simple and easy. Quality is what it is with MT. Don't blame STAS if it's a crappy translation. No matter how you slice it, it's a great function. Kudos.

I was super impressed that it will crawl the data and suggest code categories like key concepts. This is essentially topic modeling (though not as sophisticated as something like LDA. The User Guide has a whole chapter devoted to describing the details, but I haven't had time to dig in yet). Color coded clusters of concepts is a very nice function. Colors seem to refer to entity types (Person,. Org, etc). You can collapse all concepts into just the key exemplars of each cluster. There are also several nice filtering options to help you understand what your data is centered around. Here's a screenshot of my final output:

I can see key concept frequencies and filter by that. That's nice. Next steps: Can I see simple word frequencies? Ngrams?

Sentiment analysis can be done with respect to specific categories (food + positive). Pretty easy, but SPSS should mitigate lay people's over-indulgence in sentiment analysis which is tricky and not as easy as this makes it looks. This is where making something easy backfires. How can STAS encourage double checking the data? Gold Standards, sampling, etc.

No doubt, this is easy to use. An academic has the luxury of ignoring people who don't want to learn command line tools or programming languages, but the businessman does not. There's a ton of language data out there owned by thousands of companies and those companies are never going to get their regular employees to learn R just to analyze it. For them, STAS is a legitimate tool that will actually allow the average employee to dig into unstructured data. That's a win.

*In the interest of full disclosure: I do not work for IBM and this is not a sponsored blog in any way. These thoughts are entirely my own. I once worked for IBM briefly over 5 years ago and I still get the occasional IBM recruiter contacting me about opportunities, but this is my personal blog and all content is my own and reflects my honest, personal opinions.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer1506921 - PeerSpot reviewer
Advanced analytics at a manufacturing company with 10,001+ employees
Real User
Straightforward to set up, stable, and it is quick to build statistical models
Pros and Cons
  • "You can quickly build models because it does the work for you."
  • "The technical support should be improved."

What is our primary use case?

Once it is installed it does an important job because it helps with long statistical analysis processes.

What is most valuable?

You can quickly build models because it does the work for you.

What needs improvement?

The technical support should be improved.

For how long have I used the solution?

I have been using IBM SPSS Statistics for one year.

What do I think about the stability of the solution?

This is a stable product.

What do I think about the scalability of the solution?

This product is scalable and there are three versions. There is a smaller version, a mid-level, and there is a full enterprise version. The licensing can make it difficult to deploy models although it should be possible with the enterprise version. However, I have never tried it.

How are customer service and technical support?

The technical support is not very useful. You can find help on the internet but if you really have a problem then it can take a while before you get a solution.

How was the initial setup?

The setup is pretty straightforward and I think that you can easily set up your own connection.

What about the implementation team?

My client performed the setup.

What other advice do I have?

Overall, I like this product and I don't have any major things that I would like to change.

I would rate this solution an eight out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Lead Consultant at Caligo
Consultant
Good machine learning algorithms and statistical models, but technical support needs to be more responsive
Pros and Cons
  • "Some of the most valuable features that we are using with some business models are machine learning algorithms, statistical models given to us by the business, and getting data from the database or text files."
  • "Technical support needs some improvement, as they do not respond as quickly as we would like."

What is our primary use case?

The primary use case of this solution is for collecting business data and reporting to upper management. Also, I am creating some regulatory and ad-hoc reports.

What is most valuable?

Some of the most valuable features that we are using with some business models are machine learning algorithms, statistical models given to us by the business, and getting data from the database or text files.

The current features meet with the needs of our company. Our needs are not complex for the features offered.

What needs improvement?

There are some scheduling issues that created some complex flows. Creating a simple scheduling mechanism would be an improvement for this tool.

Technical support needs some improvement, as they do not respond as quickly as we would like.

The price is too expensive so they should consider reducing the cost, make it more competitive.

For how long have I used the solution?

I have been using this solution for three years.

What do I think about the stability of the solution?

This solution is stable, we have not had any issues.

What do I think about the scalability of the solution?

We have not scaled this solution as it is sufficient for us.

We have one hundred people using this solution. Most are IT, consultants.

How are customer service and technical support?

We have contacted technical support two or three times and they don't respond quickly.

How was the initial setup?

The initial setup is straightforward.

What about the implementation team?

We did not use a vendor team to implement this solution. We did it ourselves.

What's my experience with pricing, setup cost, and licensing?

We think that IBM SPSS is expensive for this function.

What other advice do I have?

In this field, there are some open-source products such as KNIME that may be useful for our purpose, because it lessens the costs.

I would rate this solution a seven out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
it_user6549 - PeerSpot reviewer
BI Expert at a university with 501-1,000 employees
Vendor
Predictive modeling and auto-model are valuable but IBM does not handle our support queries very well

For how long have you used this product?
- Three years

Which features of this product are most valuable to you?
- Predictive modeling, and its best feature is auto-model, it will give you options on which models are best suited to your data such as CHAID, c5 or Logistic REgression, it also displays accuracy, another good feature is that it has boost and reduce data for replication (else you will have to rely on sample power) for power analysis

Can you give an example of how this product has improved the way your organization functions?
- Yes, we predicted enrollment of our students more accurately

What areas of this product have room for improvement?
- pricing maybe

Did you encounter any issues with deployment, stability or scalability?
- It's definitely scalable since its integratable with SQL Server or a flat database file like excel

Did you previously use a different solution and if so, why did you switch?
- We haven't used any since Predictive analytics software is very expensive

Before choosing this product, did you evaluate other options? If so, which ones?
- We used this product this it was parallel with SPSS data sets of which most psychology and stat backgrounds use since its very usable in terms of user interface, its not much based on syntax unlike other software available.

How would you rate the level of customer service and technical support?
- Customer service and technical support is given by STRAND asia, however I would say IBM is not that good in handling our queries.

Was the initial setup straightforward or complex? In what ways?
- It became complex since IBM bought SPSS 4 years ago.

Did you implement through a vendor team or an in-house one? If through a vendor team, how would you rate their level of expertise?
- Through a vendor team

What is your ROI on this product?
- No ROI involved

What advice would you give to others looking into implementing this product?
- IBM SPSS modeler is good to integrate with IBM Cognos, make sure though that you have your data warehouse set up properly, there is also R and Python integration, you can download R essentials (32 bit and 64 bit) to work with R nuggets, definitely a big plus (imagine open source and the power of IBM to bring in a powerful and yet flexible software)

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user