2

Bridging null hypothesis testing and estimation: A practical guide to statistical conclusion drawing from research in psychology

A well-known problem of null hypothesis significance testing is that it cannot be used to find support for the null hypothesis. A common solution for this is to replace the exact …

henk-a.-l.-kiers

• 24 Jul, 2025 • 1 min read

On a generalizable approach for sample size determination in Bayesian t tests

tsz-keung-wong

• 01 Mar, 2025 • 1 min read

VALID: A Checklist-Based Approach for Improving Validity in Psychological Research

susanne-kerschbaumer

• 01 Jan, 2025 • 1 min read

Introduction to the Bayes factor: A Shiny/R app

jorge-n.-tendeiro

• 01 Jan, 2025 • 1 min read

Data-driven prior elicitation for Bayes factors in Cox regression for nine subfields in biomedicine

maximilian-linde

• 01 Jan, 2025 • 1 min read

Bayes factors for two-group comparisons in Cox regression with an application for reverse-engineering raw data from summary statistics

maximilian-linde

• 01 Jan, 2025 • 1 min read

Assessment of fit of the time-varying dynamic partial credit model using the posterior predictive model checking method

sebastian-castro-alvarez

• 01 Nov, 2024 • 1 min read

Practical Implications of Equating Equivalence Tests: Reply to Campbell and Gustafson (2022)

maximilian-linde

• 01 Jun, 2024 • 1 min read

``Adding an egg' in algorithmic decision making: improving stakeholder and user perceptions, and predictive validity by enhancing autonomy

marvin-neumann

• 01 May, 2024 • 1 min read

A Time-Varying Dynamic Partial Credit Model to Analyze Polytomous and Multivariate Time Series Data

sebastian-castro-alvarez

• 01 Jan, 2024 • 1 min read

Performance of Nonparametric Person-Fit Statistics with Unfolding versus Dominance Response Models

jennifer-reimers

• 01 Oct, 2023 • 1 min read

The Effects of Aberrant Responding on Model-Fit Assuming Different Underlying Response Processes

jennifer-reimers

• 01 Sep, 2023 • 1 min read

Decisions About Equivalence: A Comparison of TOST, HDI-ROPE, and the Bayes Factor

maximilian-linde

• 01 Jun, 2023 • 1 min read

A Review of Applications of the Bayes Factor in Psychological Research

daniel-w.-heck

• 01 Jun, 2023 • 1 min read

With Bayesian estimation one can get all that Bayes factors offer, and more

jorge-n.-tendeiro

• 01 Apr, 2023 • 1 min read

Model-data fit evaluation: Aberrant response detection

Person-fit analysis is an important field aimed at establishing the validity (or lack thereof) of the response pattern provided by each respondent of a test or questionnaire. This …

jorge-n.-tendeiro

• 01 Jan, 2023 • 1 min read

The autonomy-validity dilemma in mechanical prediction procedures: The quest for a compromise

marvin-neumann

• 01 Oct, 2022 • 1 min read

Does Functional Somatic Symptoms measurement differ across Sex and Age? Differential Item Functioning in Somatic Symptoms measured with the CIDI

Functional Somatic Symptoms (FSS) are physical symptoms that cannot be attributed to underlying pathology. Their severity is often measured with sum-scores on questionnaires; …

angelica-acevedo-mesa

• 01 Oct, 2022 • 1 min read

On the potential mismatch between the function of the Bayes factor and researchers’ expectations

The aim of this study is to investigate whether there is a potential mismatch between the usability of a statistical tool and psychology researchers' expectation of it. Bayesian …

tsz-keung-wong

• 01 Jun, 2022 • 1 min read

Mixed-Effects Trait-State-Occasion Model: Studying the Psychometric Properties and the Person-Situation Interactions of Psychological Dynamics

The trait-state-occasion model (TSO) is a popular model within the latent state-trait theory (LST). The TSO allows distinguishing the trait and the state components of the …

sebastian-castro-alvarez

• 01 May, 2022 • 1 min read

On the white, the black, and the many shades of gray in between: Our reply to van Ravenzwaaij and Wagenmakers (2021)

In 2019 we wrote a paper (Tendeiro & Kiers, 2019) in Psychological Methods over null hypothesis Bayesian testing and its working horse, the Bayes factor. Recently, van Ravenzwaaij …

jorge-n.-tendeiro

• 01 Apr, 2022 • 1 min read

Using Structural Equation Modeling to Study Traits and States in Intensive Longitudinal Data

Traditionally, researchers have used time series and multilevel models to analyze intensive longitudinal data. However, these models do not directly address traits and states …

sebastian-castro-alvarez

• 16 Jan, 2022 • 1 min read

Education Increases Decision-rule Use: An Investigation of Education and Incentives to Improve Decision Making

Robust scientific evidence shows that human performance predictions are more valid when information is combined mechanically (with a decision rule) rather than holistically (in …

marvin-neumann

• 12 Jan, 2022 • 1 min read

Worked-out examples of the adequacy of Bayesian optional stopping

The practice of sequentially testing a null hypothesis as data are collected until the null hypothesis is rejected is known as optional stopping. It is well-known that optional …

jorge-n.-tendeiro

• 01 Jan, 2022 • 1 min read

The Crit Coefficient in Mokken Scale Analysis: A Simulation Study and an Application in Quality-of-Life Research

Purpose: In Mokken scaling, the Crit index was proposed and is sometimes used as evidence (or lack thereof) of violations of some common model assumptions. The main goal of our …

daniela-r.-crisan

• 01 Jan, 2022 • 1 min read

Seven steps toward more transparency in statistical practice

We argue that statistical practice in the social and behavioural sciences benefits from transparency, a fair acknowledgement of uncertainty and openness to alternative …

eric-jan-wagenmakers

• 01 Sep, 2021 • 1 min read

Improving the measurement of functional somatic symptoms with Item Response Theory

More than 40 questionnaires have been developed to assess functional somatic symptoms (FSS), but there are several methodological issues regarding the measurement of FSS. We aimed …

angelica-acevedo-mesa

• 06 Aug, 2020 • 1 min read

On the practical consequences of misfit in Mokken scaling

Mokken scale analysis is a popular method to evaluate the psychometric quality of clinical and personality questionnaires and their individual items. Although many empirical …

daniela-r.-crisan

• 01 Jan, 2020 • 1 min read

A review of issues about null hypothesis Bayesian testing

Null hypothesis significance testing (NHST) has been under scrutiny for decades. The literature shows overwhelming evidence of a large range of problems affecting NHST. One of the …

jorge-n.-tendeiro

• 01 Dec, 2019 • 1 min read

Guilt in bereavement: Its relationship with complicated grief and depression

This study investigated the relationship between guilt and well-being of bereaved persons, and explored potential differences in the associations between guilt-complicated grief …

jie-li

• 05 Jul, 2019 • 1 min read

Practical consequences of model misfit when using rating scales to assess the severity of attention problems in children

In this study, we examined the consequences of ignoring violations of assumptions underlying the use of sum scores in assessing attention problems (AP) and if psychometrically …

daniela-r.-crisan

• 01 Jul, 2019 • 1 min read

Gender-based differential prediction by curriculum samples for college admissions

A longstanding concern about admissions to higher education is the underprediction of female academic performance by admission test scores. One explanation for these findings is …

a.-susan-m.-niessen

• 03 Jun, 2019 • 1 min read

Bayes factors for superiority, non-inferiority, and equivalence designs

In clinical trials, study designs may focus on assessment of superiority, equivalence, or non-inferiority, of a new medicine or treatment as compared to a control. Typically, …

don-van-ravenzwaaij

• 29 Mar, 2019 • 1 min read

GGUM: An R package for fitting the generalized graded unfolding model

In this article, the newly created GGUM R package is presented. This package finally brings the generalized graded unfolding model (GGUM) to the front stage for practitioners and …

jorge-n.-tendeiro

• 01 Mar, 2019 • 1 min read

What are the minimal sample size requirements for Mokken scaling? An empirical example with the Warwick-Edinburgh Mental Well-Being Scale

Sample size in Mokken scales is mostly studied on simulated data, reflected in the lack of consideration of sample size in most Mokken scaling studies. Recently, [Straat, J. H., …

roger-watson

• 08 Aug, 2018 • 1 min read

Admission testing for higher education: A multi-cohort study on the validity of high-fidelity curriculum-sampling tests

We investigated the validity of curriculum-sampling tests for admission to higher education in two studies. Curriculum-sampling tests mimic representative parts of an academic …

a.-susan-m.-niessen

• 11 Jun, 2018 • 1 min read

Corrigendum: The use of subscores in higher education: When is this useful?

A corrigendum on *The Use of Subscores in Higher Education: When Is This Useful?*, by Meijer, R. R., Boevé, A. J., Tendeiro, J. N., Bosker, R. J., and Albers, C. J. (2017). Front. …

rob-r.-meijer

• 28 May, 2018 • 1 min read

Identifying levels of general distress in first line mental health services: can GP- and eHealth clients' scores be meaningfully compared?

The Four-Dimensional Symptom Questionnaire (4DSQ) (Huisarts Wetenschap 39: 538–47, 1996) is a self-report questionnaire developed in the Netherlands to distinguish non-specific …

jan-van-bebber

• 01 Dec, 2017 • 1 min read

Investigating the practical consequences of model misfit in unidimensional IRT models

In this article, the *practical* consequences of violations of unidimensionality on selection decisions in the framework of unidimensional item response theory (IRT) models are …

daniela-r.-crisan

• 01 Sep, 2017 • 1 min read

The use of subscores in higher education: When is this useful?

Assessment in higher education is challenging because teachers face more students, with less contact time as compared to primary and secondary education. Therefore, teachers and …

rob-r.-meijer

• 07 Mar, 2017 • 1 min read

Applying organizational justice theory to admission into higher education: Admission from a student perspective

Applicant perceptions of methods used in admission procedures to higher education were investigated using organizational justice theory. Applicants to a psychology study program …

a.-susan-m.-niessen

• 07 Feb, 2017 • 1 min read

Measuring non-cognitive predictors in high-stakes contexts: The effect of self-presentation on self-report instruments used in admission to higher education

Non-cognitive constructs such as personality traits and behavioral tendencies show predictive validity for academic performance and incremental validity over and above cognitive …

a.-susan-m.-niessen

• 01 Feb, 2017 • 1 min read

$The $l^*_{z(p)}$ person-fit statistic in an unfolding model context featured image$

The $l^*_{z(p)}$ person-fit statistic in an unfolding model context

Although person-fit analysis has a long-standing tradition within item response theory, it has been applied in combination with dominance response models almost exclusively. In …

jorge-n.-tendeiro

• 01 Jan, 2017 • 1 min read

Implicit and explicit self-esteem in current, remitted, recovered, and comorbid depression and anxiety disorders: The NESDA study

Dual processing models of psychopathology emphasize the relevance of differentiating between deliberative self-evaluative processes (explicit self-esteem; ESE) and …

lonneke-a.-van-tuijl

• 15 Nov, 2016 • 1 min read

PerFit: An R package for person-fit analysis in IRT

Checking the validity of test scores is important in both educational and psychological measurement. Person-fit analysis provides several statistics that help practitioners …

jorge-n.-tendeiro

• 20 Oct, 2016 • 1 min read

Predicting performance in higher education using proximal predictors

We studied the validity of two methods for predicting academic performance and student- program fit that were proximal to important study criteria. Applicants to an undergraduate …

a.-susan-m.-niessen

• 01 Jun, 2016 • 1 min read

Derivation and applicability of asymptotic results for multiple subtests person-fit statistics

In high-stakes testing, it is important to check the validity of individual test scores. Although a test may, in general, result in valid test scores for most test takers, for some …

casper-j.-albers

• 01 Jun, 2016 • 1 min read

Individual differences in very young children's English acquisition in China: Internal and external factors

This study assesses the impact of internal and external factors on very young EFL learners in an instructional setting. 71 child English learners in China (onset age: 2;0 - 5;6) …

he-sun

• 01 May, 2016 • 1 min read

Detecting careless respondents in web-based questionnaires: Which method to use?

High data quality is an important prerequisite for sound empirical research. Meade and Craig (2012) and Huang, Curran, Keeney, Poposki, and DeShon (2012) discussed methods to …

a.-susan-m.-niessen

• 30 Apr, 2016 • 1 min read

A practical guide to check the consistency of item response patterns in clinical research through person-fit statistics: Examples and a computer program

Although there are many studies devoted to person-fit statistics to detect inconsistent item score patterns, most studies are difficult to understand for nonspecialists. The aim of …

rob-r.-meijer

• 01 Feb, 2016 • 1 min read

Person fit assessment using the PerFit package in R

The validity of scores derived from an educational or psychological testing situation determines the accuracy and appropriateness of inferences made about an examinee based on …

amin-mousavi

• 01 Jan, 2016 • 1 min read

Investigating measurement invariance in computer-based personality testing: The impact of using anchor items on effect size indices

A popular method to assess measurement invariance of a particular item is based on likelihood ratio tests with all other items as anchor items. The results of this method are often …

iris-j.-l.-egberink

• 01 Feb, 2015 • 1 min read

Detection of invalid test scores: The usefulness of simple nonparametric statistics

In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is …

jorge-n.-tendeiro

• 28 Aug, 2014 • 1 min read

$Direct transformations yielding the knight's move pattern in $3 \times 3 \times 3$ arrays featured image$

Direct transformations yielding the knight's move pattern in $3 \times 3 \times 3$ arrays

Three-way arrays (or tensors) can be regarded as extensions of the traditional two-way data matrices that have a third dimension. Studying algebraic properties of arrays is …

jorge-n.-tendeiro

• 15 Nov, 2013 • 1 min read

The probability of exceedance as a nonparametric person-fit statistic for tests of moderate length

To classify an item score pattern as not fitting a nonparametric item response theory (NIRT) model, the probability of exceedance (PE) of an observed response vector **x** can be …

jorge-n.-tendeiro

• 27 Aug, 2013 • 1 min read

Using cumulative sum statistics to detect inconsistencies in unproctored internet testing

Unproctored Internet Testing (UIT) is becoming more popular in personnel recruitment and selection. A drawback of UIT is that cheating is easy and, therefore, a proctored test is …

jorge-n.-tendeiro

• 01 Feb, 2013 • 1 min read

The use of the $l_z$ and $l_z^*$ person-fit statistics and problems derived from model misspecification

We extend a recent didactic by Magis, Raîche, and Béland on the use of the $l_z$ and $l_z^*$ person-fit statistics. We discuss a number of possibly confusing details and show that …

rob-r.-meijer

• 01 Dec, 2012 • 1 min read

A CUSUM to detect person misfit: A discussion and some alternatives for existing procedures

This article extends the work by Armstrong and Shi on CUmulative SUM (CUSUM) person-fit methodology. The authors present new theoretical considerations concerning the use of CUSUM …

jorge-n.-tendeiro

• 18 Jun, 2012 • 1 min read

Some new results on orthogonally constrained Candecomp

The use of Candecomp to fit scalar products in the context of Indscal is based on the assumption that, due to the symmetry of the data matrices involved, two components matrices …

mohammed-bennani-dosse

• 01 Jul, 2011 • 1 min read

First and second-order derivatives for CP and INDSCAL

In this paper we provide the means to analyse the second-order differential structure of optimization functions concerning CANDECOMP/PARAFAC and INDSCAL. Closed-form formulas are …

jorge-n.-tendeiro

• 15 Mar, 2011 • 1 min read

The link between sufficient conditions by Harshman and by Kruskal for uniqueness in Candecomp/Parafac

Harshman (UCLA Working Papers in Phonetics 1972; 22: 111-117) has given a proof of uniqueness (identification) of Parafac solutions, when two of the three component matrices are of …

jos-m.-f.-ten-berge

• 01 Jul, 2009 • 1 min read

Simplicity transformations for three-way arrays with symmetric slices, and applications to Tucker-3 models with sparse core arrays

Tucker three-way PCA and Candecomp/Parafac are two well-known methods of generalizing principal component analysis to three way data. Candecomp/Parafac yields component matrices …

jorge-n.-tendeiro

• 01 Feb, 2009 • 1 min read