Percentile: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Peter Schmitt
(→‎Definition: defining k)
imported>Peter Schmitt
(→‎Definition: some additional explanation)
Line 18: Line 18:
             P(\omega\ge x) \ge 1-{k\over100}   
             P(\omega\ge x) \ge 1-{k\over100}   
             \quad\quad ( k \in \mathbb N , 0 < k < 100 ) </math>
             \quad\quad ( k \in \mathbb N , 0 < k < 100 ) </math>
In this definition, ''P'' is a probability distribution on the real numbers.
It may be obtained either
* from a (theoretical) probability measure (such as the [[normal distribution|normal]] or [[Poisson distribution]], or
* from a finite population where it expresses the probability of a random element to have the property,<br>i.e., it is the relative frequency of elements with this property (number of elements with the property divided by the size of the population),or
* from a sample of size ''N'' where it also is the relative frequency which is used to estimate the corresponding percentile for the population from which it was taken.


== Special cases ==
== Special cases ==

Revision as of 18:21, 1 December 2009

This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
This editable Main Article is under development and subject to a disclaimer.

Percentiles are statistical parameters which describe the distribution of a (real) value in a population (or a sample). Roughly speaking, the k-th percentile separates the smallest k percent of values from the largest (100-k) percent.

Special percentiles are the median (50th percentile), the quartiles (25th and 75th percentile), and the deciles (the k-th decile is the (10k)-th percentile). Percentiles are special cases of quantiles: The k-th percentile is the same as the (k/100)-quantile.

Definition

The value x is k-th percentile if

In this definition, P is a probability distribution on the real numbers. It may be obtained either

  • from a (theoretical) probability measure (such as the normal or Poisson distribution, or
  • from a finite population where it expresses the probability of a random element to have the property,
    i.e., it is the relative frequency of elements with this property (number of elements with the property divided by the size of the population),or
  • from a sample of size N where it also is the relative frequency which is used to estimate the corresponding percentile for the population from which it was taken.

Special cases

For a continuous distribution (like the normal distribution) the k-th percentile x is uniquely determined by

In the general case (e.g., for discrete distributions, or for finite samples) it may happen that the separating value has positive probability:

or that there are two distinct values for which equality holds such that

Then every value in the (closed) intervall between the smallest and the largest such value

is a k-th percentiles.

Examples

The following examples illustrate this:

  • Take a sample of 101 values, ordered according to their size:
.
Then the unique k-th percentile is .
  • If there are only 100 values
.
Then any value between and is a k-th percentile.

Example from the praxis:
Educational institutions (i.e. universities, schools...) frequently report admission test scores in terms of percentiles. For instance, assume that a candidate obtained 85 on her verbal test. The question is: How did this student compared to all other students? If she is told that her score correspond to the 80th percentile, we know that approximately 80% of the other candidates scored lower than she and that approximately 20% of the students had a higher score than she had.