2

Bridging null hypothesis testing and estimation: A practical guide to statistical conclusion drawing from research in psychology featured image

Bridging null hypothesis testing and estimation: A practical guide to statistical conclusion drawing from research in psychology

A well-known problem of null hypothesis significance testing is that it cannot be used to find support for the null hypothesis. A common solution for this is to replace the exact …

henk-a.-l.-kiers

Introduction to the Bayes factor: A Shiny/R app

jorge-n.-tendeiro
Model-data fit evaluation: Aberrant response detection featured image

Model-data fit evaluation: Aberrant response detection

Person-fit analysis is an important field aimed at establishing the validity (or lack thereof) of the response pattern provided by each respondent of a test or questionnaire. This …

jorge-n.-tendeiro
Does Functional Somatic Symptoms measurement differ across Sex and Age? Differential Item Functioning in Somatic Symptoms measured with the CIDI featured image

Does Functional Somatic Symptoms measurement differ across Sex and Age? Differential Item Functioning in Somatic Symptoms measured with the CIDI

Functional Somatic Symptoms (FSS) are physical symptoms that cannot be attributed to underlying pathology. Their severity is often measured with sum-scores on questionnaires; …

angelica-acevedo-mesa
On the potential mismatch between the function of the Bayes factor and researchers’ expectations featured image

On the potential mismatch between the function of the Bayes factor and researchers’ expectations

The aim of this study is to investigate whether there is a potential mismatch between the usability of a statistical tool and psychology researchers' expectation of it. Bayesian …

tsz-keung-wong
Mixed-Effects Trait-State-Occasion Model: Studying the Psychometric Properties and the Person-Situation Interactions of Psychological Dynamics featured image

Mixed-Effects Trait-State-Occasion Model: Studying the Psychometric Properties and the Person-Situation Interactions of Psychological Dynamics

The trait-state-occasion model (TSO) is a popular model within the latent state-trait theory (LST). The TSO allows distinguishing the trait and the state components of the …

sebastian-castro-alvarez
On the white, the black, and the many shades of gray in between: Our reply to van Ravenzwaaij and Wagenmakers (2021) featured image

On the white, the black, and the many shades of gray in between: Our reply to van Ravenzwaaij and Wagenmakers (2021)

In 2019 we wrote a paper (Tendeiro & Kiers, 2019) in Psychological Methods over null hypothesis Bayesian testing and its working horse, the Bayes factor. Recently, van Ravenzwaaij …

jorge-n.-tendeiro
Using Structural Equation Modeling to Study Traits and States in Intensive Longitudinal Data featured image

Using Structural Equation Modeling to Study Traits and States in Intensive Longitudinal Data

Traditionally, researchers have used time series and multilevel models to analyze intensive longitudinal data. However, these models do not directly address traits and states …

sebastian-castro-alvarez
Education Increases Decision-rule Use: An Investigation of Education and Incentives to Improve Decision Making featured image

Education Increases Decision-rule Use: An Investigation of Education and Incentives to Improve Decision Making

Robust scientific evidence shows that human performance predictions are more valid when information is combined mechanically (with a decision rule) rather than holistically (in …

marvin-neumann
Worked-out examples of the adequacy of Bayesian optional stopping featured image

Worked-out examples of the adequacy of Bayesian optional stopping

The practice of sequentially testing a null hypothesis as data are collected until the null hypothesis is rejected is known as optional stopping. It is well-known that optional …

jorge-n.-tendeiro
The Crit Coefficient in Mokken Scale Analysis: A Simulation Study and an Application in Quality-of-Life Research featured image

The Crit Coefficient in Mokken Scale Analysis: A Simulation Study and an Application in Quality-of-Life Research

Purpose: In Mokken scaling, the Crit index was proposed and is sometimes used as evidence (or lack thereof) of violations of some common model assumptions. The main goal of our …

daniela-r.-crisan
Seven steps toward more transparency in statistical practice featured image

Seven steps toward more transparency in statistical practice

We argue that statistical practice in the social and behavioural sciences benefits from transparency, a fair acknowledgement of uncertainty and openness to alternative …

eric-jan-wagenmakers
Improving the measurement of functional somatic symptoms with Item Response Theory featured image

Improving the measurement of functional somatic symptoms with Item Response Theory

More than 40 questionnaires have been developed to assess functional somatic symptoms (FSS), but there are several methodological issues regarding the measurement of FSS. We aimed …

angelica-acevedo-mesa
On the practical consequences of misfit in Mokken scaling featured image

On the practical consequences of misfit in Mokken scaling

Mokken scale analysis is a popular method to evaluate the psychometric quality of clinical and personality questionnaires and their individual items. Although many empirical …

daniela-r.-crisan
A review of issues about null hypothesis Bayesian testing featured image

A review of issues about null hypothesis Bayesian testing

Null hypothesis significance testing (NHST) has been under scrutiny for decades. The literature shows overwhelming evidence of a large range of problems affecting NHST. One of the …

jorge-n.-tendeiro
Guilt in bereavement: Its relationship with complicated grief and depression featured image

Guilt in bereavement: Its relationship with complicated grief and depression

This study investigated the relationship between guilt and well-being of bereaved persons, and explored potential differences in the associations between guilt-complicated grief …

jie-li
Practical consequences of model misfit when using rating scales to assess the severity of attention problems in children featured image

Practical consequences of model misfit when using rating scales to assess the severity of attention problems in children

In this study, we examined the consequences of ignoring violations of assumptions underlying the use of sum scores in assessing attention problems (AP) and if psychometrically …

daniela-r.-crisan
Gender-based differential prediction by curriculum samples for college admissions featured image

Gender-based differential prediction by curriculum samples for college admissions

A longstanding concern about admissions to higher education is the underprediction of female academic performance by admission test scores. One explanation for these findings is …

a.-susan-m.-niessen
Bayes factors for superiority, non-inferiority, and equivalence designs featured image

Bayes factors for superiority, non-inferiority, and equivalence designs

In clinical trials, study designs may focus on assessment of superiority, equivalence, or non-inferiority, of a new medicine or treatment as compared to a control. Typically, …

don-van-ravenzwaaij
GGUM: An R package for fitting the generalized graded unfolding model featured image

GGUM: An R package for fitting the generalized graded unfolding model

In this article, the newly created GGUM R package is presented. This package finally brings the generalized graded unfolding model (GGUM) to the front stage for practitioners and …

jorge-n.-tendeiro
What are the minimal sample size requirements for Mokken scaling? An empirical example with the Warwick-Edinburgh Mental Well-Being Scale featured image

What are the minimal sample size requirements for Mokken scaling? An empirical example with the Warwick-Edinburgh Mental Well-Being Scale

Sample size in Mokken scales is mostly studied on simulated data, reflected in the lack of consideration of sample size in most Mokken scaling studies. Recently, [Straat, J. H., …

roger-watson
Admission testing for higher education: A multi-cohort study on the validity of high-fidelity curriculum-sampling tests featured image

Admission testing for higher education: A multi-cohort study on the validity of high-fidelity curriculum-sampling tests

We investigated the validity of curriculum-sampling tests for admission to higher education in two studies. Curriculum-sampling tests mimic representative parts of an academic …

a.-susan-m.-niessen
Corrigendum: The use of subscores in higher education: When is this useful? featured image

Corrigendum: The use of subscores in higher education: When is this useful?

A corrigendum on *The Use of Subscores in Higher Education: When Is This Useful?*, by Meijer, R. R., Boevé, A. J., Tendeiro, J. N., Bosker, R. J., and Albers, C. J. (2017). Front. …

rob-r.-meijer
Identifying levels of general distress in first line mental health services: can GP- and eHealth clients' scores be meaningfully compared? featured image

Identifying levels of general distress in first line mental health services: can GP- and eHealth clients' scores be meaningfully compared?

The Four-Dimensional Symptom Questionnaire (4DSQ) (Huisarts Wetenschap 39: 538–47, 1996) is a self-report questionnaire developed in the Netherlands to distinguish non-specific …

jan-van-bebber
Investigating the practical consequences of model misfit in unidimensional IRT models featured image

Investigating the practical consequences of model misfit in unidimensional IRT models

In this article, the *practical* consequences of violations of unidimensionality on selection decisions in the framework of unidimensional item response theory (IRT) models are …

daniela-r.-crisan
The use of subscores in higher education: When is this useful? featured image

The use of subscores in higher education: When is this useful?

Assessment in higher education is challenging because teachers face more students, with less contact time as compared to primary and secondary education. Therefore, teachers and …

rob-r.-meijer
Applying organizational justice theory to admission into higher education: Admission from a student perspective featured image

Applying organizational justice theory to admission into higher education: Admission from a student perspective

Applicant perceptions of methods used in admission procedures to higher education were investigated using organizational justice theory. Applicants to a psychology study program …

a.-susan-m.-niessen
Measuring non-cognitive predictors in high-stakes contexts: The effect of self-presentation on self-report instruments used in admission to higher education featured image

Measuring non-cognitive predictors in high-stakes contexts: The effect of self-presentation on self-report instruments used in admission to higher education

Non-cognitive constructs such as personality traits and behavioral tendencies show predictive validity for academic performance and incremental validity over and above cognitive …

a.-susan-m.-niessen
The $l^*_{z(p)}$ person-fit statistic in an unfolding model context featured image

The $l^*_{z(p)}$ person-fit statistic in an unfolding model context

Although person-fit analysis has a long-standing tradition within item response theory, it has been applied in combination with dominance response models almost exclusively. In …

jorge-n.-tendeiro
Implicit and explicit self-esteem in current, remitted, recovered, and comorbid depression and anxiety disorders: The NESDA study  featured image

Implicit and explicit self-esteem in current, remitted, recovered, and comorbid depression and anxiety disorders: The NESDA study

Dual processing models of psychopathology emphasize the relevance of differentiating between deliberative self-evaluative processes (explicit self-esteem; ESE) and …

lonneke-a.-van-tuijl
PerFit: An R package for person-fit analysis in IRT featured image

PerFit: An R package for person-fit analysis in IRT

Checking the validity of test scores is important in both educational and psychological measurement. Person-fit analysis provides several statistics that help practitioners …

jorge-n.-tendeiro
Predicting performance in higher education using proximal predictors featured image

Predicting performance in higher education using proximal predictors

We studied the validity of two methods for predicting academic performance and student- program fit that were proximal to important study criteria. Applicants to an undergraduate …

a.-susan-m.-niessen
Derivation and applicability of asymptotic results for multiple subtests person-fit statistics  featured image

Derivation and applicability of asymptotic results for multiple subtests person-fit statistics

In high-stakes testing, it is important to check the validity of individual test scores. Although a test may, in general, result in valid test scores for most test takers, for some …

casper-j.-albers
Individual differences in very young children's English acquisition in China: Internal and external factors featured image

Individual differences in very young children's English acquisition in China: Internal and external factors

This study assesses the impact of internal and external factors on very young EFL learners in an instructional setting. 71 child English learners in China (onset age: 2;0 - 5;6) …

he-sun
Detecting careless respondents in web-based questionnaires: Which method to use? featured image

Detecting careless respondents in web-based questionnaires: Which method to use?

High data quality is an important prerequisite for sound empirical research. Meade and Craig (2012) and Huang, Curran, Keeney, Poposki, and DeShon (2012) discussed methods to …

a.-susan-m.-niessen
A practical guide to check the consistency of item response patterns in clinical research through person-fit statistics: Examples and a computer program featured image

A practical guide to check the consistency of item response patterns in clinical research through person-fit statistics: Examples and a computer program

Although there are many studies devoted to person-fit statistics to detect inconsistent item score patterns, most studies are difficult to understand for nonspecialists. The aim of …

rob-r.-meijer
Person fit assessment using the PerFit package in R featured image

Person fit assessment using the PerFit package in R

The validity of scores derived from an educational or psychological testing situation determines the accuracy and appropriateness of inferences made about an examinee based on …

amin-mousavi
Investigating measurement invariance in computer-based personality testing: The impact of using anchor items on effect size indices  featured image

Investigating measurement invariance in computer-based personality testing: The impact of using anchor items on effect size indices

A popular method to assess measurement invariance of a particular item is based on likelihood ratio tests with all other items as anchor items. The results of this method are often …

iris-j.-l.-egberink
Detection of invalid test scores: The usefulness of simple nonparametric statistics featured image

Detection of invalid test scores: The usefulness of simple nonparametric statistics

In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is …

jorge-n.-tendeiro
Direct transformations yielding the knight's move pattern in $3 \times 3 \times 3$ arrays featured image

Direct transformations yielding the knight's move pattern in $3 \times 3 \times 3$ arrays

Three-way arrays (or tensors) can be regarded as extensions of the traditional two-way data matrices that have a third dimension. Studying algebraic properties of arrays is …

jorge-n.-tendeiro
The probability of exceedance as a nonparametric person-fit statistic for tests of moderate length featured image

The probability of exceedance as a nonparametric person-fit statistic for tests of moderate length

To classify an item score pattern as not fitting a nonparametric item response theory (NIRT) model, the probability of exceedance (PE) of an observed response vector **x** can be …

jorge-n.-tendeiro
Using cumulative sum statistics to detect inconsistencies in unproctored internet testing featured image

Using cumulative sum statistics to detect inconsistencies in unproctored internet testing

Unproctored Internet Testing (UIT) is becoming more popular in personnel recruitment and selection. A drawback of UIT is that cheating is easy and, therefore, a proctored test is …

jorge-n.-tendeiro
The use of the $l_z$ and $l_z^*$ person-fit statistics and problems derived from model misspecification featured image

The use of the $l_z$ and $l_z^*$ person-fit statistics and problems derived from model misspecification

We extend a recent didactic by Magis, Raîche, and Béland on the use of the $l_z$ and $l_z^*$ person-fit statistics. We discuss a number of possibly confusing details and show that …

rob-r.-meijer
A CUSUM to detect person misfit: A discussion and some alternatives for existing procedures featured image

A CUSUM to detect person misfit: A discussion and some alternatives for existing procedures

This article extends the work by Armstrong and Shi on CUmulative SUM (CUSUM) person-fit methodology. The authors present new theoretical considerations concerning the use of CUSUM …

jorge-n.-tendeiro
Some new results on orthogonally constrained Candecomp featured image

Some new results on orthogonally constrained Candecomp

The use of Candecomp to fit scalar products in the context of Indscal is based on the assumption that, due to the symmetry of the data matrices involved, two components matrices …

mohammed-bennani-dosse
First and second-order derivatives for CP and INDSCAL featured image

First and second-order derivatives for CP and INDSCAL

In this paper we provide the means to analyse the second-order differential structure of optimization functions concerning CANDECOMP/PARAFAC and INDSCAL. Closed-form formulas are …

jorge-n.-tendeiro
The link between sufficient conditions by Harshman and by Kruskal for uniqueness in Candecomp/Parafac featured image

The link between sufficient conditions by Harshman and by Kruskal for uniqueness in Candecomp/Parafac

Harshman (UCLA Working Papers in Phonetics 1972; 22: 111-117) has given a proof of uniqueness (identification) of Parafac solutions, when two of the three component matrices are of …

jos-m.-f.-ten-berge
Simplicity transformations for three-way arrays with symmetric slices, and applications to Tucker-3 models with sparse core arrays featured image

Simplicity transformations for three-way arrays with symmetric slices, and applications to Tucker-3 models with sparse core arrays

Tucker three-way PCA and Candecomp/Parafac are two well-known methods of generalizing principal component analysis to three way data. Candecomp/Parafac yields component matrices …

jorge-n.-tendeiro