what is statistics and its applications ca cpt foundation

Under this chapter we learn following topics

Statistics and its applications

Data collection technique

Distinction between primary and       secondary data

Presentation of data in textual and tabular

Cumulative frequency and frequency distribution

Graph using histogram, frequency polygon and pi chart

Use of statistic :

Statistic is used in all subjects like management, commerce, mathematics, social sciences etc.

It also used in public service like defence, banking, insurance, tourism, police, military, hospitality etc.

It collects the data, analyses it and apply in statistic inferences

 

It has universal application

 

Government, businessman, political party and research scholar use the statistics

Hence it is required to study

History

There are various views for its origin

 

This word was used in ancient and medieval period.

 

Its origin may be from any of the following term.

Latin- status, Italian- statista, German- statistik, French- statistique

 

Then it is known as a state because it collects and maintains data of the people belonging to the state.

In the 4th century B.C. , Kautilya had written Arthashastra during Chandragupta’s reign.

In the 16 century A.D. , Abu Fazl had written Ain-i- Akbari on agriculture

Definition

It can be used in singular or plural sense

 

In plural it is qualitative and quantitative

 

In singular it collects, analyses and present the data hence it is known as science of counting on science of a average

Applications of the statistics

Economics

Roots of economics are in statistics

Certain chapters like time series analysis, index number, demand analysis are included in both.

There is a branch of economics known as a econometrics which interact with statistics

Socio economic survey are based on statistical methods

In economics for the future projection of the demand, sale price, quantity regression analysis is used

Business management

In earlier days, decision was taken by managers on the basis of trial and error.

Now a days due to complexity in the business decisions are taken on the basis of quantitative techniques which is based on statistical method and operation research.

Statistics provides inferences about universe on the basis of sample.

 

Statistics in commerce and industry

Due to globalisation businesses are expanded without any limit.

Hence it needs various accounting data and statistical applications.

Limitations of statistics

It is based on aggregate hence individual item has no importance except it is a part of aggregate

It is concerned with quantitative data hence in case of qualitative data it is difficult to convert in a numerical.

Future projections are based on set of conditions.

It is based on sample samples are unrepresentative result may not be correct four population.

 

Collection of data

There are two types of data 1 quantitative 2 qualitative

Quantitative data is known as a variable

There are two types of variable 1 discrete 2 continuous

 

Discrete means isolated single value example road accidents, misprints in a book, petals in a flower

Continuous means any value from an interval example height, weight, sale, profit,

Variable can be finite or infinite.

Qualitative characteristic is known is an attribute example nationality gender colour etc.

Classification of the data

Primary – first time collected

Secondary – collected data used by another person

Methods of collection of primary data

Interview method – direct (personal) interview – investigator meets directly used in natural calamity

It is accurate method

Indirect interview – when there is a problem to reach there, information is collected from the associated person.

Telephone interview – quick, non expensive and covers large area but non response is maximum

Mailed questionnaire – well drafted and soundly sequenced questionnaire – all important aspects- with guideline- pre paid stamp. It covers large area but maximum non response.

In questionnaire method enumerator directly collect information with questionnaire, Hence non response can be minimised

Observation method – best method used- in height, weight. It is time consuming laborious Hence covers small area only

Scrutiny of data

Data should be accurate and consistence

Errors may arise in writing or copying.

Internal consistency may be checked from two or more series.

Example density = area/ population

Information submitted by enumerator can be verified and if bias is reflected errors can be rectified Or data can be collected again.

Presentation of data

After Collection of data, it should be verified for its homoginity, Compatibility, Association and consistency then it is required to present in neat and condensed form and analyse it.

 

Types of classification of data

Chronological, temporal or time series – year wise CA details

Geographical or spatial series data – State wise unemployment

Qualitative or ordinal data : intelligence, beauty, nationality

Quantitative or ordinal data : height weight profit etc.

Qualitative and quantitative are frequency group data

Time series data and geographical data are non frequency group

Mode of presentation of data

Textual presentation – paragraphs- official reports

It is simple and layman can use

But it is also dull, monotonous and comparison is difficult

 

Tabular presentation

It is a systematic presentation with raw, column, footnote, title and reference number

Caption is upper part of table

Box head is the entire upper part off the table

Stub is the left part of the table describing the raws   `

Body is the main part of the table

It is used for comparison

Without table statistical analysis is not possible

Diagrammatic representation of the data

It can use chart diagrams and program

It is used for both educated and uneducated

Trend can be noticed

Types of diagram

Line diagram or histogram

Vary over time – year wise profit

In case of fluctuation logarithm ratio chart is used

When two or more time series data in a same unit multiple line chart is used and in a different units multiple axis chart is used

Bar diagram

In horizontal bar diagram qualitative data varying over space while in vertical bar diagram quantitative data are used in multiple or g Rectangles are equal or varying length.

Bar diagram can be subdivided or it can be presented in percentagerouped bar diagram.

 

Pie chart pie diagram or circle diagram is used for comparison

Frequency distribution

It is a tabular presentation of data in ascending order

Data can be variable or attribute

It can be divided in number of categories or classes which are mutually exclusive and exhaustive

Number of times the class occurs will be Shown as a corresponding to a particular class

It is known as a frequency of that class

It distributes total frequency into number of classes

When data is discrete tabulation is known as discrete ungrouped or frequency distribution

When data is continuous tabulation is known as continuous frequency distribution

How to form frequency distribution table

Find the range. Range = largest minus smallest

Find the number of classes. Number of classes= range/ class length

Apply tally mark count the tally Mark show in the next column which is known as frequency column

Total frequency should be matched with total number of observation

Class limit- in a class interval minimum value is known as lower class limit and maximum value as a upper class limit

Class boundaries

In overlapping or exclusive classification class boundary and class limit both are equal. Used in continuous variable 10-20

In non overlapping all mutually inclusive. 10-19 LCB=LCL-D/2 AND UCB=UCL+D/2

Midpoint or mid value or class mark= LCL+ UCL/2 OR LCB+UCB/2

Width OR size of the class interval =- UCB- LCB OR UCL-Lcl

Cumulative frequency it can be less then or more than cumulative frequency. number of observations less than or equal to value or class boundary – less than cumulative frequency

Number of observations More than or equal To value – More than cumulative frequency

Frequency density frequency of that class to corresponding class length

Relative frequency or the percentage frequency of a class interval ratio of class frequency to total frequency if it is in percentage multiply with hundread

Graphical representation of frequency distribution

Histogram or area diagram

Class limits are converted to the corresponding class boundaries and a series of adjacent rectangles in class intervals

When class intervals are not uniform it is erected. Mode can be calculated easily.

 

Frequency polygon

it is known as single frequency distribution

It can be used in grouped frequency also when width of the class interval remains the same midpoint of the interval are connected.

It starts with the histogram. Mid points of upper sides of rectangles are connected.

 

 

Ogives or cumulative frequency graph :

Plot cumulative frequency against class boundary

There are two type of ogive less than or more than. Both should be presented in vertical axis.

Quartiles can be calculated perpendicular from the point of intersection of two ogive on horizontal axis. it is known as median it can be used for short term projection.

Frequency curve : it is a smooth and free hand curve total area is an Unity it is a limiting form of histogram for the frequency polygon

There are types of frequency curve

Bell shaped curve – starting from a lower value gradually reaches at maximum value at Central Part then decrease gradually at its lowest value

 

U shaped curve minimum at Central Part and maximum the two extremes Kolkata commuters at morning to evening

J shaped curve starts with the minimum frequency than increased to maximum frequency Kolkata commuters early morning to pick morning.

 

Combination of the frequency curve known as a mixed curve

Measures of Central tendency and dispersion

learning objectives

Central tendency – Mean, Median, Mode, Geometric mean and harmonic mean. – advantages and disadvantages – when these are used.

dispersion means volatility. range, mean deviation, standard deviation and quartile deviation.

 

Central tendency

definition

at starting low frequency newly increased to the maximum