## “ We are just statistics, born to consume resources” - Horace (65-8 BC Epistles Book 1)

When information for a statistical investigation is collected and recorded this is referred to as data.
What is a statistical investigation?
There are four processes involved in a statistical investigation
1 Collection of data (information)

Data for a statistical investigation can be collected from records, from surveys, by direct observation or by measuring or counting. Data can be collected for the whole population, which generally means all the people of things that the conclusions of a statistical investigation would apply to, or for a sample of the population.
Collection of data is the first and most important task in an investigation because unless the correct data is collected valid conclusions cannot be made.
###### Discrete data can be displayed by:
· Pictogram - display of information using pictures to represent frequency.
· Bar chart – display of information using bars of different lengths to represent the frequency.
This is useful for making comparison over a period of time e.g. exam results
· Pie chart – uses a circle to represent data: useful to show proportions
· Leaf-and-stem diagram – shows all the original data and therefore useful to show trend of data
· Dot diagram – each dot corresponds to one score. The frequency of a number is the number of times it occurs.
· Line graphs - Shows trend over time example the number of tourists arrival in Changi Airport over a period of 5 years.
A histogram may be used to display non-discrete (continuous) data
3 Calculation of Descriptive Statistics

There are some statistics that are universally used to describe a set of data.
Calculating statistics that indicate the centre of the data and the spread of the data give us a picture of the sample or population under investigation.
###### This process involves explaining the meaning of the table, graph or descriptive statistics in terms of the variable or theory to be investigated.Some questions that you may ask to interpret statistical data:
1. What is the title of the chart, table or graph?
2. What is the purpose of this chart, table or graph?
3. What do the labels/headings tell you?
4. What key information does the data provide?
6. How were you able to interpret this chart, table or graph?
7. Who created the chart, table or graph?
8. What information does the chart, table or graph add to the given account(s) of event?

Extension:

Types of data
Data are individual observations of a variable. A variable is a quantity that can have a value recorded for it or to which we can assign an attribute or quality.
There are two types of variable that we commonly deal with :
Categorical variables
A quality is recorded for this type of variable.
The information collected is called categorical data.
Examples of categorical variables are:
• Colour of eyes : the categories would be blue, brown, hazel, green, violet
• Gender : Male or Female
• Marital Status : Single, Married, Separated, Divorced
Numerical variables
A numerical value is recorded for this type of variable. The information collected is called numerical data.
Examples of numerical variables are :
• The number of people in a household : the variable could take the values 1, 2, 3, 4, …
• The weight of new born babies : the variable could take any value on the number line but it is likely to be in the range of 0.5 kg to 8 kg.
• The score out of 30 on a test : the variable could take the values 0, 1, 2, …, 30