span8

span4

span8

span4

- Home /

Article created with
FME Desktop 2017.1

The power of FME is being able to take data from multiple sources and manipulate it efficiently. So why not use FME for data science?

We’ve recently added a series of transformers to the FME hub that performs a few basic statistical tests using the RCaller or the PythonCaller.

If you don't see the statistical test you are looking for in this list, you can create your own and upload it to the FME Hub to share with other users or create a new Idea and if it gets enough votes will add it to the list.

Learn how to create a custom transformer using either R or Python to perform the Shapiro-Wilks test (to test for the normality of a distribution). This workflow can be adapted for any statistical test using R or Python.

Each transformer listed has a link to the FME Hub page as well as a test workspace download. Due to the external software requirements for R, these test workspaces could not be uploaded to the hub. Each of the R transformers requires R to be installed on the users' machine as well as the sqldf R package. For the Python transformers, the SciPy Python package needs to be installed.

A correlation is a test between two variables to determine their association.

Uses R to calculate if there is an association between two variables.

RCorrelation-TestWorkspace.fmwt

A Cluster Analysis is a method for determining groups of data.

Uses R to calculate similar groups of data using one of three algorithms. *This transformer only works for 2018.0+*

RClusterCalculator-TestWorkspace.fmwt

The Shapiro-Wilks test calculates whether a random sample of data comes from a normal distribution.

Using R and the RCaller this transformer calculates whether a random sample of data comes from a normal distribution using the Shapiro-Wilks test.

RShapiroWilks-TestWorkspace.fmwt

Using Scipy and the PythonCaller, this transformer calculates whether a random sample of data comes from a normal distribution using the Shapiro-Wilks test.

PyShapiroWilks-TestWorkspace.fmwt

A T-Test is a statistical test to test if the means of two samples are significantly different from random.

The one-sample t-test tests the null hypothesis that the population mean is equal to a specified value, In other words, it tells you if the mean of your sample is close enough to a certain number to be statistically significant. This test outputs the t-value, p-value, confidence interval and the estimate.

ROneSampleTTest-TestWorkspace.fmwt

The two-sample t-test tests the mean of two groups to determine if they are significantly different or it is by random chance. This test outputs the t-value, p-value, confidence interval and the estimate.

thub.nodes.view.add-new-comment

rcorrelation-testworkspace.fmwt
(7.7 kB)

ronettest-testworkspace.fmwt
(7.6 kB)

rtwottest-testworkspace.fmwt
(7.7 kB)

rshapirowilks-testworkspace.fmwt
(12.6 kB)

pyshapirowilks-testworkspace.fmwt
(12.4 kB)

clustercalculator-testworkspace.fmwt
(1.6 MB)

Perform a Shapiro-Wilk Statistical Test using R or Python

Using Python to deliver data to an external FTP site (2012 only)

Example Workflow using FME, Python and Oracle

Publishing an Item to the FME Hub

Pyfme cannot be called from the command line with Data Interoperability

Using a Python Startup/Shutdown Script or PythonCaller to Perform Geoprocessing with Arcpy

Calculating a Running Total or Cumulative Sum | StatisticsCalculator

© 2019 Safe Software Inc | Legal

- Anonymous
- Sign in
- Create
- New Question
- New Article
- New Idea
- Spaces
- 3D (and BIM)
- Attribute Handling
- CAD
- Cloud
- Coordinate Systems
- Custom Transformers
- Database
- Dynamic Workspaces
- FME Cloud API
- FME Cloud Administration
- FME Cloud Getting Started
- FME Desktop 3rd Party Integrations
- FME Desktop Administration
- FME Desktop Administration & Configuration
- FME Desktop Development
- FME Desktop Getting Started
- FME Desktop Installation
- FME Desktop Licensing
- FME Desktop Plug-In SDK
- FME Desktop Workbench Scripting
- FME Server 3rd Party Integrations
- FME Server Administration
- FME Server Administration & Configuration
- FME Server Development
- FME Server Getting Started
- FME Server Installation
- FME Server Licensing
- Fanouts
- Ideas FME Cloud
- Ideas FME Desktop: Data Inspector
- Ideas FME Desktop: Formats & Systems
- Ideas FME Desktop: Transformers
- Ideas FME Desktop: Workbench
- Ideas FME Server
- KML
- Lists
- Performance Tuning
- Point Cloud
- Published Parameters
- Raster
- Real-Time
- Running Multiple Workspaces
- Tabular
- Troubleshooting Techniques
- Vector / GIS
- Web
- Workflow Design
- XML / GML
- Zip Files
- *FME Desktop
- *FME Server
- *FME Cloud
- *Other
- Explore
- Topics
- Questions
- Articles
- Ideas
- Users
- Badges