How to Do Data Entry for Questionnaires (Step-by-Step Guide)
Data entry is a critical stage in any research process, especially when working with paper-based questionnaires. Many surveys are still conducted offline, proper data entry ensures accuracy, reliability, and meaningful analysis.
Whether youโre a student, nurse researcher, or public health professional, this guide will walk you through the process step-by-step.
๐ What is Data Entry in Research?

Data entry is the process of converting responses from paper questionnaires into a digital format for analysis using tools like Microsoft Excel or SPSS.
๐งพ Step 1: Prepare Your Questionnaire for Data Entry
Before entering any data, you must code your questionnaire.
โ Assign Numerical Codes
Convert responses into numbers:
- Yes = 1, No = 0
- Male = 1, Female = 2
- Strongly Agree = 5 โ Strongly Disagree = 1

โ Create a Codebook
A codebook helps you stay organized and ensures consistency.
Example:
| Question | Variable Name | Codes |
|---|---|---|
| Gender | gender | 1=Male, 2=Female |
| Age | age | Actual value |
| Screening done? | screening | 1=Yes, 0=No |
๐ป Step 2: Choose a Data Entry Tool
Common tools used include:
- Microsoft Excel (Beginner-friendly)
- SPSS (For analysis)
- Epi Info (Free public health tool)
- Google Sheets (Online collaboration)

๐ Best Practice: Start with Excel, then export to SPSS for analysis.
๐ Step 3: Design Your Data Entry Template
Structure your spreadsheet properly:
- Rows = Respondents
- Columns = Variables (questions)
Example:
| ID | gender | age | knowledge_score | screening |
|---|---|---|---|---|
| 001 | 1 | 25 | 8 | 1 |
| 002 | 2 | 30 | 5 | 0 |
โ๏ธ Step 4: Enter Data Accurately
- Enter codes, not words
โ Male โ โ 1 - Work one questionnaire at a time
- Use a missing value code (e.g., 99)
- Avoid skipping any responses
๐ Step 5: Data Cleaning (Very Important!)
After data entry, check for:
- Missing values
- Incorrect codes
- Outliers (e.g., age = 200)
Tools to Use:
- Excel filters
- SPSS โFrequenciesโ
๐ Step 6: Optional โ Enter Data Directly in SPSS
If using SPSS:
- Go to Variable View
- Define:
- Variable name (e.g., gender)
- Type (Numeric)
- Value labels (1=Male, 2=Female)
- Switch to Data View and start entering data
READ: SPSS TUTORIAL PDF
๐ Sample Questionnaire (Paper Format)
Imagine you conducted a study on prostate cancer awareness among commercial motorcyclists.
Section A: Socio-Demographic Data
- Age: ______
- Gender:
- Male [ ]
- Female [ ]
- Education Level:
- No formal education [ ]
- Primary [ ]
- Secondary [ ]
- Tertiary [ ]
Section B: Knowledge
- Have you heard about prostate cancer?
- Yes [ ]
- No [ ]
- Prostate cancer can be prevented
- Yes [ ]
- No [ ]
- I donโt know [ ]
Section C: Practice
- Have you ever gone for prostate screening?
- Yes [ ]
- No [ ]
๐ข Step 1: Code the Questionnaire
Before data entry, convert responses into numbers.
| Question | Variable Name | Codes |
|---|---|---|
| Age | age | Actual value |
| Gender | gender | 1=Male, 2=Female |
| Education | education | 1=None, 2=Primary, 3=Secondary, 4=Tertiary |
| Heard of PCa | heard_pca | 1=Yes, 0=No |
| Preventable | preventable | 1=Yes, 0=No, 2=Donโt know |
| Screening | screening | 1=Yes, 0=No |
๐ Step 2: Create Data Entry Sheet in Excel
Open Microsoft Excel and structure it like this:
| ID | age | gender | education | heard_pca | preventable | screening |
|---|
โ๏ธ Step 3: Enter Data from a Filled Questionnair
Example of ONE Completed Questionnaire:
- Age: 45
- Gender: Male
- Education: Secondary
- Heard of prostate cancer: Yes
- Preventable: No
- Screening: No
Convert to Codes:
- age = 45
- gender = 1
- education = 3
- heard_pca = 1
- preventable = 0
- screening = 0
Enter into Excel:
| ID | age | gender | education | heard_pca | preventable | screening |
|---|---|---|---|---|---|---|
| 001 | 45 | 1 | 3 | 1 | 0 | 0 |
๐ Another Example (Second Respondent)
- Age: 30
- Gender: Male
- Education: Primary
- Heard of prostate cancer: No
- Preventable: Donโt know
- Screening: No
Codes:
| ID | age | gender | education | heard_pca | preventable | screening |
|---|---|---|---|---|---|---|
| 002 | 30 | 1 | 2 | 0 | 2 | 0 |
โ ๏ธ Important Rules During Entry
- Enter numbers only, not text
- One row = one respondent
- Do not skip any variable
- Use a missing value (e.g., 99) if unanswered
๐ Step 4: Data Cleaning
After entering all questionnaires:
- Check for errors:
- gender = 5 โ (invalid)
- age = 150 โ
- Use filters in Excel
- Or analyze using SPSS โ Frequencies
VIDEO TUTORIAL
๐ณ๐ฌ Practical Tips for Researchers
- Number your questionnaires (e.g., 001โ200)
- Use clear handwriting during data collection
- Apply double data entry for accuracy
- Always backup your data (flash drive + email/cloud)
โ ๏ธ Common Mistakes to Avoid
- Entering text instead of numeric codes
- Skipping the codebook
- Mixing up rows and columns
- Ignoring data cleaning
- Losing questionnaires before entry ๐
๐ก Final Thoughts
Accurate data entry is the foundation of quality research. No matter how good your analysis is, poor data entry will lead to unreliable results.
For students and researchers, mastering this process will significantly improve your project quality, especially when working with tools like SPSS for advanced analysis such as regression.
๐ข Need Help?
If you need:
- A ready-made Excel template
- SPSS setup for your research
- Data cleaning or analysis support
Feel free to reach out to Nursing Research and Data Institute!