Tech Tips

Quickly Audit Data in IBM SPSS Modeler

To improve your experience using IBM SPSS Modeler, the Version 1 SPSS experts have created various Tech Tips. This Tech Tip shows how to quickly audit data in IBM SPSS Modeler. 

IBM SPSS Modeler is an extensive predictive analytics platform designed to bring predictive intelligence to decisions made by individuals, groups, systems, and the enterprise. Modeler has an easy-to-use drag-and-drop user interface with a complete set of tools for accessing data, data examination, preparation, modelling, evaluation, and deployment. 

IBM SPSS Modeler users have a complete toolset to build predictive models from start to finish. Modeler uses node-based, visual programming. Users pick nodes from palettes and place them on the stream canvas. Once nodes have been placed on the stream canvas and edited, they can be linked to form a stream. A stream represents a flow of data through several operations (nodes) to a destination that can be in the form of output (either text or chart), a model, or the export of data to another format (e.g., a database). 

Obtaining a complete overview of all data takes time and effort. Modeler has the data audit node, which provides a comprehensive view of all fields in the data. The Data Audit node shows each field with thumbnail graphs, statistics, field measurements, outliers, extreme cases, and missing data. 

To quickly audit data in Modeler, go to the Output palette. Select the Data Audit node and drag it onto the stream canvas. You can also double-click the node to drop it onto the stream canvas. Once it is on the canvas, you can connect it to your stream. Double-click to open the node. We could click the custom fields button to audit a subset of the data. Select Graphs and Basic and Advanced Statistics.

There is also the option to calculate the median and mode. The Audit tab lists each field and shows a thumbnail graph and statistics. Users can click on a thumbnail graph to see distributions or histograms. The Quality tab shows data types, outliers, extremes, and missing data. Users can sort on % Complete to see which fields have missing data. Users can also click on the column top and use the arrows to sort fields. 

The Data Audit is a fast overview of your data in just a few clicks. Users can also examine data by a target field using the Overlay option in the Data Audit settings. 

Tools Covered

IBM SPSS Modeler

Related Solutions

Training

Tagged As IBM SPSS Modeler

Need some help?

Image of three women working on laptops at a table for Version 1 SPSS Training

Learn how to use SPSS from the experts

With more than 20 years of delivering highly successful training programs, Version 1 offers a wide range of training options to best suit your requirements, enabling you to optimise your IBM SPSS Software, achieve your analytical goals and continually improve your results.

Related Tech Tips

Our SPSS experts have created a range of Tech Tips for IBM SPSS Modeler. Take a look through.

Arrange a free consultation to discuss your analytical needs and identify the best solution for you.