Concept of Data Collection
Concept of Data Collection
Data collection is the process of gathering and measuring information on variables of interest, in
an established systematic fashion that enables one to answer stated research questions, test
hypotheses, and evaluate outcomes.
The goal for all data collection is to capture quality evidence that then translates to rich data
analysis and allows the building of a convincing and credible answer to questions that have been
posed.
TYPES OF DATA
Data are organized into two broad categories: qualitative and quantitative.
Qualitative Data: Qualitative data are mostly non-numerical and usually descriptive or
nominal in nature. This means the data collected are in the form of words and sentences.
Often (not always), such data captures feelings, emotions, or subjective perceptions of
something. Qualitative approaches aim to address the ‘how’ and ‘why’ of a program and
tend to use unstructured methods of data collection to fully explore the topic. Qualitative
questions are open-ended. Qualitative methods include focus groups, group discussions
and interviews.
Questionnaire:
It is the most commonly used method in survey. Questionnaires are a list of questions either
open-ended or close-ended for which the respondents give answers. Questionnaire can be
conducted via telephone, mail, live in a public area, or in an institute, through electronic mail
or through fax and other methods.
Interview:
Interview is a face-to-face conversation with the respondent. In interview, the main
problem arises when the respondent deliberately hides information otherwise it is an in
depth source of information. The interviewer can not only record the statements the interviewee
speaks but he can observe the body language, expressions and other reactions to the
questions too.
Observations:
Observation can be done while letting the observing person know that s/he is being
Observed or without letting him know. Observations can also be made in natural settings as well
as in artificially created environment.
Advantages of Primary Data
More Accurate: Primary data is researched and developed by individuals giving
personal attention to each of the questions they wanted to survey. Assessments could be
made by individuals working on it. Therefore making data more valid and accurate.
Updated information: The data which is produced is first-hand data. No other person’s
data has been used to develop information that makes the data updated and people rely on
information that is more updated and recently been produced.
More Control over data: Since the person who is surveying through questionnaires or
any other design can manage the data, thus he has control over the whole survey. He can
manipulate or make certain updates in the questionnaire to make it more meaningful.
Privacy is maintained: The expert who is maintaining all the survey work also
maintains the secrecy of the data. It is pretty obvious that when few individuals maintain
the whole survey work then mostly secrecy is maintained among them without leaking
the ideas
The targeted problem is dealt with: People who are engaged in the collection of data
prepare the questionnaire and sometimes take the interview from the targeted group to
obtain data. Also, the problem is addressed so that after proper feedback it could be put in
the limelight and can be resolved.
Understanding of data is better: The data surveyed through different designs and
methods are made simple and easy to understand so that the person who is interested in
data can easily understand it with a glance.
Time-consuming process: It may take a lot of time to collect the data from raw sources.
Costly: The survey done is time-consuming as well as costly. For the collection of data,
experts have to visit and use equipment that may cost very high.
Require more labour: The manpower requirement is more in the case of primary data as
only one person cannot make a survey on his own or collect data single-handedly.
The questionnaire must be easy and understandable: The questionnaire prepared must
be easy to understand then only the researchers may get correct and valid feedback. The
researchers have to make the set of sample questionnaires in such a way or use the
method or technique that may help the people to interpret it easily if not the feedback
which is produced will be wrong or inaccurate.
Feedback may be faulty: If the technique used by researchers will not be proper or
accurate then the feedback also which is produced will be faulty or incorrect. Thus proper
action and maintenance must be done so that the feedback data which is produced should
be authentic.
An experienced person is needed for the analysis: The method for collecting data,
technique, and analysis must be done by an expert as only an expert can make the whole
research meaningful by providing genuine facts and information.
Secondary Data
Data collected from a source that has already been published in any form is called as
secondary data. The review of literature in any research is based on secondary data. It is
collected by someone else for some other purpose (but being utilized by the investigator
for another purpose). For examples, Census data being used to analyze the impact of education
on career choice and earning.
Sources of Secondary Data:
The following are some ways of collecting secondary data;
Books, Records, Biographies, Newspapers, Published censuses or other statistical data, Data
archives, Internet articles, Research articles by other researchers (journals), Databases, etc.
Importance of Secondary Data:
Secondary data can be less valid but its importance is still there.
Sometimes it is difficult to obtain primary data; in these cases getting information from
secondary sources is easier and possible.
Advantages of Secondary Data:
Ease of access: The secondary data sources are very easy to access. The Internet has
changed the way secondary research works. Nowadays, you have so much information
available just by clicking with the mouse.
Low cost or free: The majority of secondary sources are absolutely free for use or at
very low costs. It saves not only your money but your efforts.
Time-saving
Secondary data collection can be done in no time. Sometimes it is a matter of a few
Google searches or records to find a source of data.
Allow you to generate new insights from previous analysis: Reanalyzing old data can
bring unexpected new understandings and points of view or even new relevant
conclusions.
Longitudinal analysis: Secondary data allows you to perform a longitudinal analysis
which means the studies are performed spanning over a large period of time. This can
help you to determine different trends. In addition, you can find secondary data from
many years back up to a couple of hours ago. It allows you to compare data over time.
Anyone can collect the data: Secondary data research can be performed by people that
aren’t familiar with the different data collection methods. Practically, anyone can collect
it.
A huge amount of secondary data with a wide variety of sources: It is the richest type
of data available in a wide variety of sources and topics.
Disadvantages:
Might not be specific to your needs: Secondary data is not specific to the researcher’s
needs due to the fact that it was collected in the past for another reason. Secondary data
sources can give you a huge amount of information, but quantity does not always mean
appropriateness.
No control over data quality: The secondary data might lack quality. The source of the
information may be questionable, especially when you gather the data via the Internet.
Biasness: As the secondary data is collected by someone else than you, typically the data
is biased in favour of the person who gathered it.
Not timely: Secondary data is collected in the past which means it might be out-of-date.
This issue can be crucial in many different situations.