8.2.15
(2027 Exams) Reliability (A2 only)
Test your knowledge with free interactive questions on Seneca — used by over 10 million students.
Measuring Reliability: Internal and External
Reliability is how consistent a test or study is. Reliability can be measured in several ways: test-retest, the split-half method and correlating inter-observer reliability.

Reliability
- Reliability is how consistent a test or study is.
- In other words, does the test, when repeated in identical conditions or with similar participants, come up with the same results?

Test-retest
- Test-retest is when a study is repeated.
- If the results are similar both times, the test is said to have external reliability.
- External reliability is when a test or study should consistently produce the same results no matter the time given.

Examples of external reliability
- A simple example would be if a doctor gave two pregnancy tests to a woman and the results came back positive both times. This means the test is reliable.
- Another example would be a maths test. If a student took a maths test twice and got similar results, the test is said to have external reliability.

Split-half method
- The split-half is used to measure the internal reliability of a test.
- The split-half method takes one test and divides it into two sections.
- This can be even/odd numbered questions or first half/second half.

Internal reliability
- If there is a strong positive correlation between both halves, the test is said to have internal reliability.
- In other words, a participant scores similar results in both sections of the tests.
- Internal reliability is looking at the same test - do all parts of the test have consistent results?
Measuring Reliability: Inter-Observer Reliability
Reliability is how consistent a test or study is. Reliability can be measured in several ways: test-retest, the split-half method and correlating inter-observer reliability.

Inter-observer reliability
- In a highly idealised study, the exact same researcher would administer the test to all participants.
- This is to avoid inconsistent results due to the extraneous variable of different researchers.
- Obviously, this can be challenging to do. So researchers must follow a specific and standardised procedure to avoid this.

Example - aggressive five-year-olds
- For example, if two separate researchers are observing aggressive behaviour in five-year-olds, they should give the same score (1-10) for the same level of aggression.
- A five-year-old who hits another child would receive a score of 10.
- If both researchers observed the behaviour and gave the same score of 10, then the test is said to have inter-observer reliability.
Improving Reliability and Validity
Reliability and validity are essential for making sure that psychological tests are accurate, fair and meaningful. Standardising research and operationalising variables improve both validity and reliability.

Standardising research
- Standardising research is crucial for having reliable and valid results.
- Scientific studies follow strict guidelines and specific procedures.
- An example of this would be a study on short-term memory recall. The aim of this study is to look at the effect of age on short-term memory.

Conditions and procedure
- Psychologists would decide on the conditions of the study.
- For example: a well-lit room, started at 9:00AM, have same race, same-gendered participants and one researcher in the room.
- The study would follow a set procedure of handing out papers, giving instructions from a set script, and setting a certain time limit on memory questions.

Extraneous variables
- The procedures must be as specific as possible to make sure that there are as few extraneous variables as possible.
- Extraneous variables are anything that could affect your results.
- These set procedures make sure that external reliability and inter-observer reliability are improved.

Operationalising variables
- When setting up a study, psychologists determine their aim and from there, establish their independent and dependent variables.
- They will also try to minimise extraneous variables which could affect their results.
- The variables must be clearly defined (to operationalise).

Guidelines
- In the Kanner et al. (1981) study examining how daily hassles link to stress and health, they had to clearly define stress, daily hassles and health.
- The participants had to rate their hassles, making it clearer.
- If they were just to list hassles, but with no guidelines or descriptions, the results could have potentially been less valid.
- Making the variables crystal clear improves the validity and reliability of the test.
1Social Influence
1.1Social Influence
2Memory
2.1Memory
3Attachment
3.1Attachment
4(2026 Exams) Psychopathology
4.1Psychopathology
5(2027 Exams) Clinical Psychology & Mental Health
5.1Clinical Psychology & Mental Health
6Approaches in Psychology
6.1Approaches in Psychology
6.2Comparison of Approaches (A2 only)
7Biopsychology
7.1Biopsychology
8Research Methods
8.1Research Methods
8.2Scientific Processes
8.3Data Handling & Analysis
9Issues & Debates in Psychology (A2 only)
9.1Issues & Debates in Psychology (A2 only)
10Option 1: Relationships (A2 only)
10.1Relationships: Sexual Relationships (A2 only)
10.2Relationships: Romantic Relationships (A2 only)
10.3(2026 Exams) Relationships: Virtual (A2 only)
10.4(2027 Exams) Relationships: Online (A2 only)
11Option 1: Gender (A2 only)
11.1(2026 Exams) Gender (A2 only)
11.2(2027 Exams) Gender (A2 only)
12Option 1: Cognition & Development (A2 only)
12.1Cognition & Development (A2 only)
13Option 2: Schizophrenia (A2 only)
13.1Schizophrenia: Diagnosis (A2 only)
13.2Schizophrenia: Treatment (A2 only)
14Option 2: Eating Behaviour (A2 only)
14.1Eating Behaviour (A2 only)
15Option 2: Stress (A2 only)
15.1Stress (A2 only)
16Option 3: Aggression (A2 only)
16.1Aggression: Physiological (A2 only)
16.2Aggression: Social Psychological (A2 only)
17Option 3: Forensic Psychology (A2 only)
17.1Forensic Psychology (A2 only)
18Option 3: Addiction (A2 only)
18.1Addiction (A2 only)
18.2Treating Addiction (A2 only)
Jump to other topics
1Social Influence
1.1Social Influence
2Memory
2.1Memory
3Attachment
3.1Attachment
4(2026 Exams) Psychopathology
4.1Psychopathology
5(2027 Exams) Clinical Psychology & Mental Health
5.1Clinical Psychology & Mental Health
6Approaches in Psychology
6.1Approaches in Psychology
6.2Comparison of Approaches (A2 only)
7Biopsychology
7.1Biopsychology
8Research Methods
8.1Research Methods
8.2Scientific Processes
8.3Data Handling & Analysis
9Issues & Debates in Psychology (A2 only)
9.1Issues & Debates in Psychology (A2 only)
10Option 1: Relationships (A2 only)
10.1Relationships: Sexual Relationships (A2 only)
10.2Relationships: Romantic Relationships (A2 only)
10.3(2026 Exams) Relationships: Virtual (A2 only)
10.4(2027 Exams) Relationships: Online (A2 only)
11Option 1: Gender (A2 only)
11.1(2026 Exams) Gender (A2 only)
11.2(2027 Exams) Gender (A2 only)
12Option 1: Cognition & Development (A2 only)
12.1Cognition & Development (A2 only)
13Option 2: Schizophrenia (A2 only)
13.1Schizophrenia: Diagnosis (A2 only)
13.2Schizophrenia: Treatment (A2 only)
14Option 2: Eating Behaviour (A2 only)
14.1Eating Behaviour (A2 only)
15Option 2: Stress (A2 only)
15.1Stress (A2 only)
16Option 3: Aggression (A2 only)
16.1Aggression: Physiological (A2 only)
16.2Aggression: Social Psychological (A2 only)
17Option 3: Forensic Psychology (A2 only)
17.1Forensic Psychology (A2 only)
18Option 3: Addiction (A2 only)
18.1Addiction (A2 only)
18.2Treating Addiction (A2 only)
Practice questions on (2027 Exams) Reliability (A2 only)
Can you answer these? Test yourself with free interactive practice on Seneca — used by over 10 million students.
- 1Ways of measuring reliability:Fill in the list
- 2
- 3
- 4When is inter-observer reliability particularly important? Multiple choice
- 5Who studied how daily hassles link to stress and health?Multiple choice
Unlock your full potential with Seneca Premium
Unlimited access to 10,000+ open-ended exam questions
Mini-mock exams based on your study history
Unlock 800+ premium courses & e-books
