Introduction to Biostatistics: Scope, Variable & Data

Objectives of this article

To understand

  • What is statistics
  • What is biostatistics
  • Scopes of statistics
  • What is variable
  • What is random variable
  • Types of variables
  • What is qualitative or discrete variable
  • What is quantitative or continuous variable
  • Practical examples of qualitative or quantitative variables


Statistics is defined as a body of processes for creating reasonable and wise decisions in the face of uncertainty. These are applied in the analysis of numerical data of various aspects including interpretation of data on the basis of certain statistical principles.

Statistics is a field of study involved with techniques or methods of collection of data, classification, summarizing, interpretation, drawing, inferences, testing of hypothesis, making recommendations etc. only when a part of data is used.

Biostatistics or Biometry

Biostatistics is the term referred when tools of statistics are applied to the data that is derived from biological sciences. In other words when the principles of statistics are applied to study of organisms or living system, the study is called biostatistics or biometry

It encompasses the design of biological experiments, especially in medicine, pharmacy, agriculture and fishery; the collection, summarization, and analysis of data from those experiments; and the interpretation of, and inference from the obtained results.

Scope of Statistics

1. In Physiology and Anatomy

  1. To define what is normal or healthy in a population and to find limits of normality in variables such as weight & pulse rate.
  2. To find the difference between means and proportions of normal at two places or in different periods. The mean height of Bangladeshi boys is less than the mean height of American boys . Whether this difference is due to chance or a natural variation or because of some other factors such as better nutrition playing a part, has to be decided.
  3. To find the correlation between two variables X and Y such as height & weight whether weight increases or decreases proportionately with height & if so by how much has to be found.

2. In Pharmacology

  1. To find the action of drug. For examples,a drug given to animals or humans to see whether the changes produced are due to drug or by chance
  2. To compare the action of two different drugs or two successive dosages of the same drug.
  3. To find the relative potency of a new drug with respect to a standard drug.

3. In Medicine

  1. To compare the efficacy of a particular drug, operation or line of treatment . For example, the percentage cured,relieved or died in the experiment & control groups is compared & difference due to chance or otherwise is found by applying statistical technique.
  2. To find an association between two attributes such as cancer & smoking -an appropriate test is applied for this purpose.
  3. To identify sign & symptoms of a disease or syndrome . Cough in typhoid is found by chance and fever is found in almost every case . The proportional incidence of one symptom or another indicates whether it is a characteristic feature of the disease or not.

4. In Community Medicine & Public Health

  1. To test usefulness of vaccines in the field – percentage of attacks or deaths among the vaccinated subjects is compared with that among the non-vaccinated ones to find whether the difference observed as statistically significant .
  2. In epidemiological studies – the role of causative factors is statistically tested . Deficiency of iodine as an important cause of goitre in a community is confirmed only after comparing the incidence of goitre cases before and after giving iodised salt.

1. Mahajan BK 2002 (Methods in Biostatistics) (6th edition)
2. Zaman SM, HK Rahim and M Howlader 1982. (Simple Lessons from Biometry), BRRI
3. Class Note.


In statistical language any character, characteristic or quality that varies is called variable.

A characteristic that takes on different values in different persons, places or things such as height, weight, blood pressure, age etc is variable. It is denoted usually as “x”. Variables can be of two types: Categorical & Numerical variable.


  • Variable is usually represented by x. Such as x1 x2 x3 ………  xn , where number of variable is n and individual variable is x.
  • Variation is created due to genetic recombination.
  • Variation can be caused by both artificial and natural mutation.

Random variable

Random variable is a variable whose value is a numerical outcome of a random phenomenon. For example: Flip three coins and let x represent the number of heads. Here,  x is a random variable.

Random variable is not a probability. Its value doesn’t need to be positive or between 0 and 1 as in the case of probability.

Numerical variables is divided into two categories:

  • Qualitative or Discrete Data or, Variable
  • Quantitative or Continuous Data or, Variable

Qualitative or Discrete Data or, Variable

Qualitative Data are classified by counting the individuals or things having the same characteristic or attribute; and not by measurement. Examples:

  • The number of cars in a parking lot,
  • Number of quarters in a purse, jar, or bank,
  • Ages on birthday cards (always in discrete number like 21 years old) etc.

Individuals with the same characteristic are counted to form specific groups or classes. 

Qualitative data are discrete in nature, such as, number of deaths in different years, population of different towns, persons with different blood groups in a population.


  • Discrete variables have no continuity. So, they are also called qualitative variables.

Quantitative or Continuous Data or, Variable

A continuous variable is a variable that has an infinite number of possible values. In other words, any value is possible for the variable.

A continuous variable doesn’t have to have every possible number (like -infinity to +infinity), it can also be continuous between two numbers, like 1 and 2. For example, data of a discrete variable could be 1, 2 while the continuous variables could be 1, 2 and also everything or anything in between: 1.00, 1.01, 1.001, 1.0001…


  • Time it takes a computer to complete a task,
  •  A person’s weight,
  • Age etc.

The weight of students from 2nd year are (in kg) 40.9, 45, 55, 50.1, 53, 54, 54, 48, 48.5, 46, 70, 85, 82, 83.1, 62.5 etc.(See how the number varies within a range)

In case of quantitative/continuous data there are two variables- the characteristics such as height & the frequency. We find the characteristic as well as the frequency both vary from person to person as well as from group to group.

The quantitative data obtained from characteristic variable (e.g. height of individuals in 2nd year) are called continuous data because each individual has one measurement from a continuous spectrum or range.

Some of the statistical methods employed in analysis of quantitative data are mean, range, standard deviation, coefficient of variation etc.


Mahajan BK 2002. (Methods in Biostatistics) (6th Edition)

Print Friendly, PDF & Email
3.3 4 votes
Article Rating

About Tarannum Ahsan

Tarannum Ahsan
I'm a student of department of botany at University of Dhaka. I'm learning a lot of new interesting things about different spheres of botany and I'll keep updating about them to keep your knowledge of nature enriched.

Check Also

Linkage and Recombination (Part 2): Sex linkage in Drosophila, Linked gene & Synthetic gene

After Linkage and recombination part 1 We are going to learn from this part- Sex …

Notify of
Inline Feedbacks
View all comments
Would love your thoughts, please comment.x