Data integrity — validation and verification

Vocabulary

English	Chinese	Pinyin
integrity	完整性	wán zhěng xìng
validation	验证	yàn zhèng
verification	核对	hé duì
range check	范围检查	fàn wéi jiǎn chá
check digit	校验位	jiào yàn wèi
double entry	双重录入	shuāng chóng lù rù
parity check	奇偶校验	jī ǒu jiào yàn
checksum	校验和	jiào yàn hé

Keeping data accurate

Integrity 完整性 means data is accurate and complete.
Two techniques guard it: validation 验证 (before storing) and verification 核对 (when entering or transferring).
They answer different questions — you need both.

Explore

Computing concept lab

Classify concrete examples by the computing idea they demonstrate.

Validation — does the data make sense?

Validation checks data against sensible rules, automatically:
range check 范围检查 (a month is 1–12), length check (right number of characters),
type/character check (digits only), format check (an email contains @),
presence check (required fields not empty), and a check digit 校验位 (an extra digit computed from the rest, as on ISBNs and card numbers, that spots transcription errors).

Practice

Checking that a month entered is between 1 and 12 is a:

range check
presence check
format check
check digit

Practice

Match each validation check to its purpose.

a required field is not left empty

data matches a pattern (e.g. an email has @)

an extra digit spots transcription errors

Presence check
Format check
Check digit

Practice

An extra digit computed from the others to catch a mistyped number (as on ISBNs and card numbers) is a ______ digit.

What validation can't do

Validation catches data that is wrongly formatted.
It cannot catch data that is the right format but factually wrong — typing "Bob" instead of "Bib" passes every check.
For that, you need verification (and human care).

Practice

Why can validation still let bad data through?

data can be the right format but factually wrong (e.g. "Bob" for "Bib")
validation always rejects correct data
validation only works on numbers
validation needs the internet

Verification — was it copied correctly?

Verification checks the data wasn't changed in moving from one place to another.
On entry: double entry 双重录入 (type it twice and compare, like a new password) or a visual check.
On transfer (bits can flip): a parity check 奇偶校验 (an extra bit makes the number of 1s even/odd — catches single-bit errors), a checksum 校验和 (a summary value recomputed and compared), or a stronger CRC.

The parity bit is set to make the number of 1s even or odd

A parity bit is added so the number of 1s is even (or odd) — a single flipped bit breaks the count.

Working out a checksum for a block of data

A checksum is a summary value computed from the data, then recomputed on arrival and compared.

Practice

A parity check works by:

adding a bit so the number of 1s is even (or odd), then re-counting at the receiver
typing the data twice
encrypting the data
compressing the data

Practice

Asking a user to type a new password twice is an example of:

double-entry verification
a range check
encryption
a checksum

Validation vs verification

Validation asks: "is this data sensible?"
Verification asks: "was this data copied/entered correctly?"
Verification proves what arrived matches what was sent — not that it is correct, and not against deliberate tampering.

Validation checking data against rules versus verification checking it was copied correctly

Validation asks if the data is sensible; verification asks if it was copied or entered correctly

Data validation checks data is sensible (range, length, type, format, presence, existence check, limit check); data verification checks it was entered/copied correctly.

Practice

Which statement is correct?

validation asks "is it sensible?"; verification asks "was it copied correctly?"
they are the same thing
verification checks data is factually true
validation only happens during transfer

Practice

Verification proves the data was copied or entered without change — but not that the value is factually correct.

You've got it

Key idea

validation = automatic sensible-rule checks (range, length, type, format, presence, check digit)
validation can't catch the right-format-but-wrong value
verification = was it copied correctly: double entry (input), parity/checksum/CRC (transfer)
validation = "sensible?"; verification = "copied correctly?" — use both

Data integrity — validation and verification

Keeping data accurate

Computing concept lab

Validation — does the data make sense?

What validation can't do

Verification — was it copied correctly?

Validation vs verification

You've got it

Handout

Log in or create account

Feedback & help