Possible alternatives if your data violate rank sum test assumptions

 

Google

Web This Site

Featured Article: Where to start...

Documents for quality control plans, internal audit plans, ISO 9001, ISO 9001:2000, management systems, mil-i-45208, QC manuals, quality control manuals, quality control systems, quality management systems, total quality management

Home
 
Construction
Manuals
Procedures
FAA
Forms
Kits
ISO 9001
Quality Plan

Helpful Links
 
Basics
Copyright
Link Directory
Link Exchange
Privacy
Resources
SPC Definitions
Stat Guide
Where-to-Start

Non-Tech Links
 
Cool Tool
Frustration
Inspiration
Opinion
Nonsense
Serious


Sponsored Links
 

 

Possible alternatives if your data violate rank sum test assumptions


If the data for one or both of the samples to be analyzed by a rank sum test come from a population whose distribution violates the assumption of same distributional shape, or if outliers are present, then the rank sum test on the original data may provide misleading results, or may not be the most powerful test available. Transforming the data to promote normality and then performing a two-sample t test, or using another nonparametric test may provide a better analysis.

Alternative procedures:


Transformations:
Transformations (a single function applied to each data value) can be applied to correct problems of unequal dispersions. Transforming the two samples to remedy nonnormality often results in correcting heteroscedasticity (unequal dispersions). If such a transformation can be found, the transformed data may be suitable for use with a two-sample t test. The resulting test may be more powerful than the original rank sum test. The same transformation should be applied to both samples.

Unless scientific theory suggests a specific transformation a priori, transformations are usually chosen from the "power family" of transformations, where each value is replaced by x**p, where p is an integer or half-integer, usually one of:

  • -2 (reciprocal square)
  • -1 (reciprocal)
  • -0.5 (reciprocal square root)
  • 0 (log transformation)
  • 0.5 (square root)
  • 1 (leaving the data untransformed)
  • 2 (square)

For p = -0.5 (reciprocal square root), 0, or 0.5 (square root), the data values must all be positive. To use these transformations when there are negative and positive values, a constant can be added to all the data values such that the smallest is greater than 0 (say, such that the smallest value is 1). (If all the data values are negative, the data can instead be multiplied by -1, but note that in this situation, data suggesting skewness to the right would now become data suggesting skewness to the left.) To preserve the order of the original data in the transformed data, if the value of p is negative, the transformed data are multiplied by -1.0; e.g., for p = -1, the data are transformed as x --> -1.0/x. Taking logs or square roots tends to "pull in" values greater than 1 relative to values less than 1, which is useful in correcting skewness to the right. Transformation involves changing the metric in which the data are analyzed, which may make interpretation of the results difficult if the transformation is complicated. If you are unfamiliar with transformations, you may wish to consult a statistician before proceeding.

Other nonparametric tests:
Although the rank sum test is the most commonly used nonparametric alternative to the unpaired two-sample t test, it is not the only one. However, all tests assume that the two samples are independent of each other, and that there is independence within each sample.

A median test can be calculated by creating a 2x2 contingency table of counts of the values in each sample that are greater or not greater than the median of both samples together. Then this contingency table can be tested by a chi-square test or Fisher's exact test. This test does not assume equality of dispersions, but is likely to be less powerful than the rank sum test when the dispersions are in fact comparable.

Unpaired two-sample t test
If the sampled values do indeed come from populations with normal distributions, then the unpaired two-sample t test is the most powerful test of the equality of the two means, meaning that no other test is more likely to detect an actual difference between the two means. (If a distribution is symmetric, its mean and median are both equal to the center of symmetry. Since the normal distribution is symmetric, the t test can also be viewed as testing whether the difference between the two sample medians is 0, if the normality assumption holds.) If the population distributions are not normal, however, the rank sum test may be more powerful at detecting differences between the sample medians.

If applying a transformation promotes normality, the unpaired two-sample t test may be a more powerful test than the rank sum test for the transformed data.


Examine the glossary.

Back to StatGuide home page.

Get Adobe to read our PDF evaluation docs.
 

Satisfaction Guaranteed
If you cannot edit At-PQC™ document(s) with your MS Office, OpenOffice or compatible cloud software program, we will fix it or refund your purchase.

Can't find what you're looking for...?
Please call, Fax or Email Us at:

Cell-Phone Support 6:00am to 6:00pm.

Home Page

Click here to bookmark At-PQC™ then visit our Toolbox to find a quality control plan that will help you achieve an effective and efficient business infrastructure that focuses on customer satisfaction, continuous improvement and desirable cost savings. Visit with us today for comprehensive assistance in developing or choosing the right quality control plan for your business. Click here to visit our extensive selection of quality control plans, policies, procedures and forms or click here for help with where-to-start.

 

We can interact with you anywhere in the USA from 6:00am to 6:00pm Monday through Friday except holidays.

JnF Specialties, LLC
664 Greenscape Lane
Colorado Springs, Colorado 80916

Cellphone Support 6:00am to 6:00pm.

Email Us at:

Send an email to request support or call our helpline during your office hours.

Click here to let us know how we're doing.

Get Adobe | About Us | Site Map | Contact Us | Privacy Policy
Policies | Procedures | FAA | Forms | Kits | ISO 9001 | Quality Plan

( quality control plan . com )