Stat-Tree

Stat Tree HELP!

Glossary

Categorical – refers to data in which variable values are mutually exclusive, such as nominal or ordinal.

Confirmatory Factor Analysis – a method for determining the latent variables underlying a theoretically predicted set of observed phenomena (Knoke, 2005). For a more detailed explanation go to this website: Confirmatory Factor Analysis at Science Direct.

Continuous – refers to data in which variable values are numerically related, such as interval or ratio.

Exploratory Factor Analysis – a method for determining what latent variables called factors may underlie an observed phenomenon. For a more detailed explanation go to this website: Exploratory Factor Analysis at the Mailman School of Public Health, Columbia University.

Interval – values are related by sequence and imply equal degree or distance between each value such as on a scale.

Nominal – data that are measured as having presence or absence of some quality and are unrelated numerically.

Ordinal – data that are measured as related in sequence only such as rank ordered.

Path Analysis – a method for determining causal relationships between multiple variables simultaneously using graphs representing strength of relationship (Vogt, 2005). For a more detailed explanation go to this website: Path Analysis at the Mailman School of Public Health, Columbia University.

Ratio – data that are measured as related by sequence, are fractionally possible, and have an absolute zero on a scale.

Scoring – the process of converting raw data from a data collection instrument into a matrix of numbers based on the instrument used to collect the data, the level of data measurement, the type of statistics planned for analyzing the data, and the requirements of the statistical program used for analysis.

Significance criterion – a ratio typically set at .05 in the social sciences which represents the reciprocal of 95% confidence that results of a statistical test are not due to chance, reported in the results as "Sig." or "p-value".

Structural Equation Modelling – a method for determining causal relationships among multiple latent variables using more than one structural equation (Vogt, 2005). For a more detailed explanation go to this website: Structural Equation Modeling in the Directory of Statistical Analyses at IntellectusConsulting.

References

Note: Links to code sources are available in the scripts provided for each test demonstration. Additional sources may be found on the relevant modules in Stat-Tree.

American Psychological Association. (2020). Publication manual of the American Psychological Association (7th ed.). https://doi.org/10.1037/0000165-000

Andrews, F. M., Klem, L., Davidson, T. N., O'Malley, P. M., & Rodgers, W. L. (1981). A guide for selecting statistical techniques for analyzing social science data (2nd ed.). The University of Michigan: Institute for Social Research. https://catalog.nlm.nih.gov/permalink/01NLM_INST/1o1phhn/alma9910274723406676

Babbie, E. (2002). The basics of social research (2nd ed.). Wadsworth.

Bostrom, R. N. (1998). Communication research. Waveland.

Brewer, J., & Hunter, A. (1989). Multimethod research: A synthesis of styles. Sage.

Christensen, L. B., & Stoup, C. M. (1991). Introduction to statistics for the social and behavioral sciences (2nd ed.). Brooks/Cole.

Doornik, J. A., & Hansen, H. (2008). An omnibus test for univariate and multivariate normality. Oxford Bulletin of Economics and Statistics, 70(Supplement). https://doi.org/10.1111/j.1468-0084.2008.00537.x

Frey, L. R., Botan, C. H., & Kreps, G. L. (2000). Investigating communication: An introduction to research methods (2nd ed.). Allyn and Bacon.

Hemphill, J. F. (2003). Interpreting the magnitudes of correlation coefficients. American Psychologist, 58(1), 78-79. https://doi.org/10.1037/0003-066X.58.1.78

Keyton, J. (2018). Communication research: Asking questions, getting answers (5th ed.). McGraw-Hill.

Knapp, T. R. (1978). Canonical correlation analysis: A general parametric significance-testing system. Psychological Bulletin, 85(2), 410-416. https://doi.org/10.1037/0033-2909.85.2.410

Knoke, D. (2005). Structural equation models. In K. Kempf-Leonard (Ed.), Encyclopedia of Social Measurement (pgs. 689-695). Elsevier. https://doi.org/10.1016/B0-12-369398-5/00392-3

Leys, C., Klein, O., Dominicy, Y., & Ley, C. (2018). Detecting multivariate outliers: Use a robust variant of the Mahalanobis distance. Journal of Experimental Social Psychology, 74, 150–156. https://doi.org/10.1016/j.jesp.2017.09.011

Mertler, C.A. & Vannatta, R.A. (2005). Advanced and multivariate statistical methods: Practical application and interpretation (3rd ed.). Pyrczak.

Nimon, K. F. (2012). Statistical assumptions of substantive analyses across the general linear model: A mini-review. Frontiers in Psychology, 3. Article 322. https://doi.org/10.3389/fpsyg.2012.00322

Pearson, K. (1895, June 20). Notes on regression and inheritance in the case of two parents. Proceedings of the Royal Society of London, 58, 240-242.

Pearson, K. (1900). On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine, Series 5, 50(302), 157-175. https://doi.org/10.1080/14786440009463897

Reinard, J. C. (2006). Communication research statistics. Sage.

Salkind, N. J. (2000). Statistics for people who (think they) hate statistics. Sage.

Salkind, N. J. (2010). Omega squared. In N. J. Salkind (Ed.), Encyclopedia of research design (p. 289). Sage. https://doi.org/10.4135/9781412961288.n289

Sommer, B., & Sommer, R. (2002). A practical guide to behavioral research: Tools and techniques (5th ed.). Oxford University.

Student. (1908). The probable error of a mean. Biometrika, 6(1), 1-25. https://doi.org/10.2307/2331554

Tabachnick, B. G., & Fidell, L. S. (2001). Using multivariate statistics (5th ed.). Pearson/Allyn & Bacon.

Verardi, V., & Dehon, C. (2010). Multivariate outlier detection in Stata. The Stata Journal, 10(2), 259-266.

Vogt, W. P. (2005). Dictionary of statistics & methodology: A nontechnical guide for the social sciences (3rd ed.). Sage.

About

Stat Tree^™ began as a project to show students in my undergraduate research methods class how to choose the best statistical test for a given research hypothesis by answering a few simple questions. The first draft was created as a visual flowchart in Microsoft Excel^™ in October 2006. In creating the flowchart, I consulted most directly the flowchart created by Andrews, et al. (1981). However, their guide was created for statisticians and statistics students. What was needed was a simplified decision tree for non-statisticians and students learning research design for the first time. Thus began the development of what would become Stat Tree^™.

For years, a flat-file statistics decision tree was made available to students in my classes in PDF format based on my edits to an original flowchart I built in Excel™. In Fall 2007, I taught my first graduate-level research methods class, and began adding more sophisticated statistical techniques to the flowchart. As students’ research hypotheses diversified, the need to add more techniques, including nonparametric statistics, became necessary.

In Fall 2014, I set about developing my first hybrid online course in research methods. It became apparent that what was needed in this teaching modality was an online interactive statistics decision tree. Instructional Designer, Michael Brand of the University of Texas at San Antonio suggested that I try using a software program designed for making interactive web-based puzzles, Quandary, to create the decision tree. Over the period of the next few months, I used this program to create the infrastructure for the Statistics Decision Tree. I edited the first html prototype using Adobe Dreamweaver™ and other tools and added in content from the research methods courses that I had authored over the years. The original goal making a simplified version for undergraduates had morphed into a need to create two separate interactive Statistics Decisions Trees, one for the undergraduate class and a more advanced version for the graduate class. My projects caught the attention of the UTSA Office of Online Learning, and I was invited to present my project at the Innovations of Online Learning Conference in May 2015.

In Spring 2018, my project caught the attention of the UTSA Office of Commercialization who encouraged me to participate in regional National Science Foundation Innovation Corps (Southwest NSF i-Corps^™) training in Houston, May 2018. I put together a team including Les Doss and David Cortez and travelled to Houston for a 3-week training program. The training was a success and resulted in a recommendation from the Regional i-Corps^™ to submit a proposal to the National Science Foundation, based on the Statistics Decision Tree prototype. That proposal was awarded a $50,000 NSF i-Corps^™ grant, Spring 2019 (Award ID: 1925391). The Stat Tree^™ team was born.

The Stat Tree^™ team travelled to Nashville for national training in customer discovery and commercial development of the Statistics Decision Tree prototype. Over the course of seven weeks, the Stat Tree^™ team travelled all over the country to discover needs of potential users. What was confirmed in these interviews (128 interviews conducted) was that a strong need existed for an interactive Statistics Decision Tree tool outside the classroom in multiple industries, as reported in the peer-reviewed Journal of Strategic Innovation and Sustainability. The most common challenge in these industries was the need to make quick decisions related to statistics for hypothesis testing among non-statisticians. Additionally, interviews revealed that users needed a tool that would demonstrate how to conduct statistical testing using multiple scripting languages used in the most common statistical packages, including R, SAS^™, SPSS^™, and Stata^™.

After i-Corps^™, I continued developing Stat Tree™ as well as related courses I teach at UT San Antonio. In October 2020, my undergraduate research methods course, Conduct of Communication Inquiry, was certified by Quality Matters (QM) as meeting the Quality Matters Higher Education Course Design Rubric Standards in an Official Review. News of this achievement was published in UT San Antonio Today, November 4, 2020. Stat Tree™ development continued as part of my course development.

With version 4.0, Stat Tree™ became a stand-alone tool, privately hosted but available to students in my classes. From this version, the first public release (Version 4.1) sprang forth. Version changes from the beginning of development are listed below.

This latest iteration of Stat Tree^™ provides a statistics decision tree covering over 30 different parametric and non-parametric bivariate and multivariate tests with scripting samples for all tests in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™. Stat Tree^™ also provides demonstrations for several statistical diagnostics, and univariate and multivariate descriptive statistics including normality testing and outliers detection.

Stat Tree™ Version 5.0 was featured in UT San Antonio Today. Watch the StoryTellers Movement podcast on YouTube for a discussion about Stat Tree™ development.

If you use information from this website, please cite as:

LeBlanc, H. P., III. (2025). Title. Stat-Tree.com. https://www.stat-tree.com/Modules/xxx.

For Title, please include the title of the specifically cited page. For the URL, please use the address for the specific page in place of xxx.
Copy citation buttons are provided on all script and video transcript pages in Stat Tree.

H. Paul LeBlanc III, Ph.D., (ORCID iD: 0000-0001-5053-0403, Google Scholar, LinkedIn)
Founder and CEO
Stat Tree^™, LLC

What's New

Stat Tree^™ has added demonstrations for common univariate, bivariate, and multivariate statistical tests in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™, with scripts and outputs. See Tests for a complete list of statistical tests demonstrated in Stat-Tree^™. Also, check out the new and improved Help!

For a demonstration of features, be sure and watch the Stat Tree^™ Welcome and Demonstration video. Transcript for the Stat Tree demonstration video is available here.

Stat-Tree^™ was created by a human using RI (Real Intelligence)!

The code for Stat-Tree^™ and all associated pages was written using the open-source VSCodium.

Version 8.1 (released April 22, 2026)

Full accessibility provided for all demonstration outputs, scripts, and transcripts. No pdf files!
This version completes the design parameters for including demonstrations for all statistical tests across six platforms (Julia, Python^™, R, SAS^®, SPSS^™, and Stata^™), including:
- New demonstrations for two additional tests in Julia (Canonical Correlation and Logistic Regression), and
- New demonstrations for Linear Discriminant Analysis in Julia and Python.

Version 8.0 (released March 6, 2026)

Public release of newly revised: Quantitative Research Methods: A Practical Approach (2nd ed.).
Effect size calculators with user input provided.
Cronbach's alpha Reliability Analysis for Julia, Python, R, SAS, SPSS, and Stata now available in Stat Tree Help!
Provided demonstrations for two additional tests in Python (Canonical Correlation and Logistic Regression).
Ad free video demonstrations now available.
New privacy protections in website user analytics.
Increased accessibility of video transcripts.

Version 7.1 (released November 26, 2025)

Public release of companion ePub: Quantitative Research Methods: A Practical Approach.
Provided demonstrations for four additional tests with effect size calculations in Julia (MANOVA, MANCOVA, Factorial MANOVA, and Factorial MANCOVA).
Revised user interface for better usability on mobile devices.

Version 7.0 (released September 3, 2025)

Revised Stat-Tree statistical test and Stat-Tree Help! scripts now in webpage format with easier to copy code.
Effect size calculations fully implemented for all covered comparative type parametric and nonparametric tests across six platforms: Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™.
Provided demonstrations for three additional tests with effect size calculations in Julia (Cochran's Q, Contingency Analysis, and McNemar's Test).
Added new support links for Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™.
New look: Changed background for more contrast with Stat-Tree and Stat-Tree Help! Modules to increase visibility and accessibility.
Responsive design for better usability on mobile devices.
New footer links to Stat Tree policies on Accessibility, Privacy, use of Cookies, as well as a links to Copyright Notifications.
New Opt Out pop-up for Cookies.
Conformance to WCAG 2.2 Levels A and AA, as well as accessibility standards under Section 508 of the Rehabilitation Act.

Version 6.1 (released July 1, 2025)

Provided demonstrations for two additional tests in Python^™ (Factorial MANCOVA and Factorial MANOVA).
Provided demonstrations for four additional tests in Julia and Python^™ (Factorial ANCOVA and Factorial ANOVA).
Revised scripts and outputs demonstrating effect size calculations for fourteen different parametric and nonparametric tests across five platforms (Python^™, R, SAS^™, SPSS^™, and Stata^™) and ten different parametric and nonparametric tests in Julia.

Version 6.0 (released March 14, 2025)

Provided a statistical tests index direct access link.
Provided links to advanced tests from the basic tests.
Provided demonstrations for an additional five tests in Python^™ (Cochran’s Q, Contingency Analysis with the Cochran-Mantel-Haenszel test, McNemar, MANCOVA, and MANOVA).
Provided demonstrations for an additional five tests in Julia and Python^™ (ANCOVA, Repeated Measures ANOVA, Cramer’s V, Somers’ d, and Friedman).
Provided demonstrations for Kendall’s tau-b in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™.
Revised scripts for Julia, Python^™, and R.
Provided post-hoc pairwise comparison tests for ANOVA and Repeated Measures ANOVA in Python^™, R, SAS^™, SPSS^™, and Stata^™.
Provided a sample hypothesis with each test.
Revised Help! section includes:
- Added a priori Power Analysis sample size estimation in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™.
- Added data Winsorization demonstrations in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™.
- Added Installing Packages demonstration in Stata^™.
- Restructured Help! section for clarity.

Version 5.1 (released December 10, 2024)

Provided demonstrations for an additional six tests in Julia and Python^™ (Mann-Whitney U, Kruskal-Wallis H, Wilcoxon Signed-rank, Spearman and Biserial Correlation, and Multiple Linear Regression).
Provided demonstrations for univariate descriptive statistics (in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™) including:
- Categorical descriptives and data visualization
- Ordinal descriptives and data visualization
Expanded demonstrations for univariate descriptive statistics in Excel^™ including copyable formulas.
Provided demonstrations for multivariate descriptive statistics in Julia and Python^™ including:
- Multivariate normality test
- Multivariate outlier detection tests
Revised Help! section includes:
- Reshaping data from long to wide format (in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™)
- Creating new environments (in Julia, Python^™, and R)

Version 5.0 (released September 1, 2024)

Provided demonstrations for the original six tests (Chi-Square, Paired and Independent Samples t-Tests, One-way ANOVA, Pearson Correlation, and Simple Linear Regression) in Julia and Python^™.
Provided demonstration for univariate descriptive statistics in Julia and Python^™ including:

Measures of central tendency: Means and Standard Deviations.
Skewness and Kurtosis.
Scatterplots and Histograms.
Univariate Normality and Outlier Detection tests.

Included demonstrations (in Julia, Python^™, R, SAS^™, SPSS^™, and Stata^™) for Levene’s Test for Homogeneity of Variance under Descriptive Statistics: Assumption Testing.
Revised help section to include:
- Working with command prompts for terminal-based programming in Julia, Python^™, and R.
- Importing data in Julia and Python^™.
- Installing packages in Julia, Python^™, and R.
- Restarting the command prompt kernel in Julia, Python^™, and R.

Version 4.1 (August 15, 2023), First public release.

Fixed broken links revealed following posting to online server.
Posted to online server.

Version 4.0 (January 2023)

Provided demonstrations for all tests in R, SAS^™, SPSS^™, and Stata^™.
Added branches for descriptions of Exploratory and Confirmatory Factor Analysis, Path Analysis, and Structural Equation Modelling.

Version 3.0 (May 2019), Revised prototype for NSF i-Corps^™.

The demonstration video presented at i-Corps is available here. Transcript for the i-Corps demonstration video is available here.

Provided demonstrations in SPSS^™ for:
- Canonical Correlation,
- Discriminant Analysis, and
- Logistic Regression.

Version 2.0 (2016), Revised Statistics Decision Tree for graduate students.

Provided demonstrations in SPSS^™ for factorial, multivariate, and Repeated Measures ANOVA.
Provided demonstrations in SPSS^™ for multiple Regression.
Provided demonstrations in SPSS^™ for nonparametric equivalents of presented parametric tests.

Version 1.0 (2014), First iteration of the Statistics Decision Tree for undergraduate students.

Provided demonstrations in SPSS^™ for Chi-Square, Paired and Independent Samples t-Tests, One-way ANOVA, Pearson Correlation, and Simple Linear Regression.
Provided demonstrations of univariate descriptive statistics and visualizations in SPSS^™.

Stat-Tree

Stat Tree HELP!

Glossary

References

Links

About

What's New

Contact Stat-Tree™

Contact Stat-Tree^™