SOCR -- Statistics Online Computational Resource. A very comprehensive collection of online calculators and other interactive resources, including: Distributions (interactive graphs and calculators), Experiments (virtual computer-generated analogs of popular games and processes), Analyses (collection of common web-accessible tools for statistical data analysis), Games (interfaces and simulations to real-life processes), Modeler (tools for distribution, polynomial and spectral model-fitting and simulation), Graphs, Plots and Charts (comprehensive web-based tools for exploratory data analysis), Additional Tools (other statistical tools and resources), SOCR Wiki (collaborative Wiki resource), Educational Materials and Hands-on Activities (varieties of SOCR educational materials), SOCR Statistical Consulting and Statistical Computing Libraries. (added 05/17/2008)
Sampling SIM: Downloadable program (for Mac or Windows) to explore sampling distributions of sample means and proportions. It provides separate windows for building population distributions, drawing and viewing random samples from the population, exploring the behavior of sampling distributions of sample means, and exploring the behavior of confidence intervals. (added 05/17/2008)
STATISTICA, from StatSoft -- a fully functional time limited (30 days) desktop version 8 of STATISTICA Advanced + QC. Contains all modules from Base, Advanced, and QC products: Descriptives, ANOVA, Regression, Nonparametrics, QC Charts, Process Analysis, Design of Experiments, Six Sigma toolbar and calculator, Advanced Linear/Nonlinear Models, Multivariate and Exploratory Techniques, and Power Analysis. (added 03/15/2008)
WinSPC (30-day free trial)-- statistical process control software to:
Gene Shackman's page of links to free software packages. Contains sections for Statistical software, CDC/Census Bureau software, R, Other software, Lists of free stat software, Statistics with Excel, Mapping/GIS software, Non-statistical (but still useful) software, Office Suites (word processors -- stand-alone or web-based), Spreadsheets, Databases, Graphics, Web browsers / FTP clients, SUrvey software, Security software, and Miscellaneous. (added 12/14/2007)
SYSTAT 12 -- powerful statistical software ranging from the most elementary descriptive statistics to very advanced statistical methodology. Novices can work with its friendly and simple menu-dialog; statistically-savvy users can use its intuitive command language. Carry out very comprehensive analysis of univariate and multivariate data based on linear, general linear, and mixed linear models; carry out different types of robust regression analysis when your data are not suitable for conventional multiple regression analysis;compute partial least-squares regression;design experiments, carry out power analysis, do probability calculations on many distributions and fit them to data; perform matrix computations. Provides Time Series, Survival Analysis, Response Surface Optimization, Spatial Statistics, Test Item Analysis, Cluster Analysis, Classification and Regression Trees, Correspondence Analysis, Multidimensional Scaling, Conjoint Analysis, Quality Analysis, Path Analysis, etc. A 30-day evaluation version is available for free download. (added 11/19/2007)
OpenEpi Version 2.2 -- OpenEpi is a free, web-based, open source, operating-system-independent series of programs for use in public health and medicine, providing a number of epidemiologic and statistical tools. Version 2 (4/25/2007) has a new interface that presents results without using pop-up windows, and has better installation methods so that it can be run without an internet connection. Version 2.2 (2007/11/09) lets users run the software in English, French, Spanish, or Italian. (updated 11/19/2007)
Data Shaping Solutions -- Data Mining Job Board / Data Mining Meta Directory / Analytic Job Board. Specializing in Business Intelligence, Statistics, Analytics, Data Management, Data Mining, Data Analysis, SAS Programming, CRM, Artificial Intelligence, Web Mining, Six Sigma, Operations Research, Risk Management, Database Marketing and Quant. Also offers a wide array of expertise in design of experiments, time series, predictive modeling, survey analysis, customer profiling, pattern recognition, statistical testing, data mining across several industries (including Finance, Internet, Marketing and Litigation), programming languages (such as SAS, Java, C, C#, Perl, Splus/R and SQL), and internet solutions such as black box trading. (added 09/22/2007)
Factor -- a comprehensive factor analysis program. Provides univariate and multivariate descriptive statistics of input variables (mean, variance, skewness, kurtosis), Var charts for ordinal variables, dispersion matrices (user defined , covariance, pearson correlation, polychoric correlation matrix with optional Ridge estimates). Uses MAP, PA (Parallel Analysis), and PA - MBS (with marginally bootstrapped samples) to determine the number of factors/components to be retained. Performs the following factor and component analyses: PCA, ULS (with Heywood correction), EML, MRFA, Schmid-Leiman second-order solution, and Factor scores. Rotation methods: Quartimax, ,Varimax , Weighted Varimax, Orthomin , Direct Oblimin, Weighted Oblimin, Promax, Promaj , Promin, and Simplimax. Indices used in the analysis: dispersion matrix tests (determinant, Bartlett's, Kaiser-Meyer-Olkin), goodness of fit: Chi-Square ,non-normed fit index, comparative fit index, goodness of fit index, adjusted GFI, RMS error of approx, and estimated non-centrality parameter (NCP), reliabilities of rotated components , simplicity indices: Bentlers, and loading simplicity index. Provides mean, variance and histogram of fitted and standardized residuals, and automatic detection of large standardized residuals. (added 08/31/2007)
22 Distribution Functions -- There is one spreadsheet for each of the following distribution functions: Beta, Binomial, Chi-Square, Discrete Uniform, Gamma, Geometric, Hypergeometric, Multivariate Hypergeometric, Laplace, Logistic, Multinomial, Negative Binomial, Normal, Bivariate Normal, Log-normal, Pareto, Poisson, Rectangular, Snedecor F, Student-t, Triangular, and Weibull. Each spreadsheet gives a graph of the distribution, along with the value of various parameters, for whatever shape and scale parameters you specify. You can also download a ZIP file containing all 22 spreadsheets. (added 08/16/2007)
SSC-Stat -- an Excel add-in designed to strengthen those areas where the spreadsheet package is already strong, principally in the areas of data management, graphics and descriptive statistics. SSC-Stat is especially useful for datasets in which there are columns indicating different groups. Menu features within SSC-Stat can:
SISA (Simple Interactive Statistical Analysis) -- SISA allows you to do statistical analysis directly on the Internet. Click on one of the procedure names below, fill in the form, click the button, and the analysis will take place on the spot. Study the user friendly guides to statistical procedures to see what procedure is appropriate for your problem. (added 08/13/2007)
Distributions -- Windows program allows for the analysis of discrete single dimension distributions. The program is based on various manipulations of the poisson, binomial and hypergeometric distribution. Available are the probability of an observed number of cases given a certain null hypothesis, the calculation of exact poisson, binomial or hypergeometric confidence intervals, the exact and approximate size of a population using catch-recatch methodologies, the full analysis of a Poisson distributed rate ratio, Fieller analysis, and two versions of the negative binomial distribution can be used in various ways. Beside the exact procedures there are also various approximate procedures available. From the Downloads section of the QuantitativeSkills web site. (added 08/13/2007)
Distributions -- Windows program allows for the analysis of discrete single dimension distributions. The program is based on various manipulations of the poisson, binomial and hypergeometric distribution. Available are the probability of an observed number of cases given a certain null hypothesis, the calculation of exact poisson, binomial or hypergeometric confidence intervals, the exact and approximate size of a population using catch-recatch methodologies, the full analysis of a Poisson distributed rate ratio, Fieller analysis, and two versions of the negative binomial distribution can be used in various ways. Beside the exact procedures there are also various approximate procedures available. From the Downloads section of the QuantitativeSkills web site. (added 08/13/2007)
Multinomial -- This Windows program is the exact solution to the Chi-square Goodness of fit test of testing for a difference between an observed and an expected distribution in a one-dimensional array. For example, the test can be used to compare the distribution of diseases in a certain locality with an expected distribution on the basis of national or international experiences using an ICD classification. In a two-category array the multinomial test provides a two-sided solution for the Binomial test. For example, Multinomial {10 20 0.20 0.80} gives the two-sided probability (0.105) for the single sided Binomial {0.20 10 30} probability (0.061). The multinomial allows you to work with empty '0' observation cells although you must have an expectation about a cell. From the Downloads section of the QuantitativeSkills web site. (added 08/13/2007)
Tables -- a Windows program for the analysis of tables with up to 2*7 and 3*3 cells. The program allows for exact and approximate statistics to be calculated for traditional, ordinal and agreement tables. Fisher exact, Number Needed to Treat, Proportional Reduction in Error Statistics, Normal Approximations, Four different Chi-squares, Gamma, Odds-ratio, t-tests and Kappa are among the many statistical procedures available. From the Downloads section of the QuantitativeSkills web site. (added 08/13/2007)
Lifetable -- does a full abridged current life table analysis to obtain the life expectancy of a population. Furthermore, one can calculate Potential Gains in Life Expectancy (PGLE) after removing cause k, considering competing causes of death; the (Premature) Years of Potential Life Lost (YPLL), this is the number of person years added to the total number of person years lived in a population if cause of death k would be removed; the Standardized Mortality Ratio (SMR), standardized numbers per 100,000 and the Comparative Mortality Figure (CMF) can also be calculated. From the Downloads section of the QuantitativeSkills web site. (added 08/13/2007)
Intracorrelation -- does intra correlation calculations for dichotomous or binary yes/no type outcome variables according to two different methods proposed for the single cluster one by Fleiss and another one by Bennett et.al. A third spreadsheet concerns a method for two clusters by Donner and Klar. You will have to insert your own data by overwriting the tables in the second (total number of positive responses) and third (total number of negative responses) or fourth column (total number). From the Downloads section of the QuantitativeSkills web site. (added 08/13/2007)
A collection of MS-DOS program from the Downloads section of the QuantitativeSkills web site: