Prof. Dr. Steffi Pohl

Methods and Evaluation / Quality Assurance
Professor
Room KL 23/226
14195 Berlin
Office hours
By arrangement.
Professional career
- Since October 2019: Full Professor of Methods and Evaluation/Quality Assurance, Department of Education and Psychology, Freie Universität Berlin
- 2013-2019: Assistant Professor of Methods and Evaluation/Quality Assurance, Department of Education and Psychology, Freie Universität Berlin
- 2009-2013: Research Associate at the department of Psychology focusing on Methods of Empirical Education Research (Dr. Claus Carstensen, Professor), Project: National Education Panel Study (NEPS), Otto-Friedrich-Universität Bamberg
- 2005-2009: Research Associate at the Department of Methodology and Evaluation Research (Dr. Rolf Steyer, Professor), Friedrich-Schiller Universität Jena
- 2007: Research Associate in the project kompetenztest.de (Dr. Christof Nachtigall), Friedrich-Schiller-Universität Jena
- 2004-2005: Research Associate in the e-learning project SPSSinteraktiv (Dr. Renate Soellner, Professor), Freie Universität Berlin
- 2004: Research Associate (substitute) at the Methods of Psychology Division (Dr. Albrecht Iseler, Professor), Freie Universität Berlin
Education
- 2005-2010: Doctorate, Friedrich-Schiller-Universität Jena (advisor: Rolf Steyer)
- 2002-2003: Study program “MSc Research Methods in Psychology”, University of Hertfordshire, UK
- 1998-2004: Study program toward the degree of Diplom in Psychology at Freie Universität Berlin
Awards and Distinctions
- 2020 Early Career Award of the Psychometric Society
- 2011: Gustav A. Lienert Dissertation Award from the German Psychological Association for Methodology and Evaluation
Functions
- since 2021 Professorial member of the Academic Senate of Freie Universität Berlin
- since 2020 Member of the Editorial Council of the Psychometric Society
- 2018 - 2020 Member of the Executive Committee in the European Association of Methodology
- 2017 - 2019 Speaker of the German Psychological Association for Methodology and Evaluation
- 2011 - 2017 Board member of the German Psychological Association for Methodology and Evaluation
Editorial Activities
- since 2018 Associate Editor of Psychometrika
- 2016-2019 Associate Editor of the British Journal of Mathematical and Statistical Psychology
- since 2016 Editorial Board of Zeitschrift für Psychologie
Memberships
Current Lectures in 2024/2025, Winter Semester
- Lecture: Methods of empirical social research
- Seminar: Quantitative Research
- Lecture: Introduction to Psychology - Research Methods
- Seminar: Specific methods of multivariate research
- Research colloquium 
Research Interests
- Psychometrics
- Log data analysis
- Missing values
- Causal inference
Research Projects can be found here.
Publications
Journal Articles (peer-reviewed)
- 
- Noack L., Taubner R.-S., Pohl, S., Pinna S., Robert S., Foucher F., de Vera J.-P. P. (in press). Exploring the Diversity within EANA and AbGradE as Mirror of the European Astrobiology Community. International Journal of Astrobiology.
- 
Hoffmann, J., Twardawski, M., Höhs, J., Gast, A., Pohl, S., & Sengewald, M.-A. (in press). The 
 design of current replication studies: A systematic literature review on the variation of study
 characteristics. Advances in Methods and Practices in Psychological Science.
- Alfers, T., Gittler, G., Ulitzsch, E., & Pohl, S. (2024). Assessing the speed–accuracy tradeoff in psychological testing using experimental manipulations. Educational and Psychological Measurement. https://doi.org/10.1177/00131644241271309
- 
Engelke, L., Calvano, C., Pohl, S., Winter, S.M., & Renneberg, B. (in press). Parental mental health 
 and child maltreatment in the COVID-19 Pandemic: the sampling matters. Journal of Medical
 Internet Research, 27. e52043. https://doi.org/10.2196/52043
- 
Daehn, D., Meyer, C., Loew, V., Wabiszczewicz, J., Pohl, P., Böttche, M., Pawils, S., Renneberg, B. (2024). Smartphone-based intervention for postpartum depressive symptoms (Smart-e-Moms): Study protocol for a randomized controlled trial. DOI: 10.1186/s13063-024-08304-5 . 
- Domingue, B.W., Kanopka, K., Kapoor, R., Pohl, S., Chalmers, R.P., Rahal, C., & Rhemtulla,
 M. (2024). The InterModel Vigorish as a lens for understanding (and quantifying) the value
 of item response models for dichotomously coded items. Psychometrika, 89(3), 1034-1054.
 https://doi.org/10.1007/s11336-024-09977-2
- Mutak, A., Krause, R., Ulitzsch, E., Much, S., Ranger, J., & Pohl, S. (2024). Modeling the
 intraindividual relation of ability and speed within a test. Journal of Educational Measurement,
 61(3). 378-407. https://doi.org/10.1111/jedm.12391
- Ulitzsch, E., Zhang, S., & Pohl, S. (2024). A model-based approach to the disentanglement and differential treatment of engaged and disengaged item omissions. Multivariate Behavioral Research, 59(3), 599–619. https://doi.org/10.1080/00273171.2024.2307518
- Ulitzsch, E., Pohl, S., Khorramdel, L., Kroehne, U., & von Davier, M. (2023). Using response times for joint modeling of careless responding and attentive response styles. Journal of Educational and Behavioral Statistics, 49(2), 173-206. https://doi.org/10.3102/10769986231173607
- Ranger, J., Wolgast, A., Much, S., Mutak, A., Krause, R., & Pohl, S. (2023). Disentangling different aspects of change in tests with the D-Diffusion model. Multivariate Behavioral Research 58(5). https://doi.org/10.1080/00273171.2023.2171356
- Schulze, D., Reuter, B., & Pohl, S. (2022). Measurement invariance: Dealing with the uncertainty in anchor item choice by model averaging. Structural Equation Modeling, 29(4), 521–530. https://doi.org/10.1080/10705511.2021.2012785
- Ulitzsch, E. Pohl, S., Khorramdel, L., Kroehne, U., & von Davier, M. (2022). A response-time-based latent response mixture model for identifying and modeling careless and insufficient effort responding in survey data. Psychometrika, 87(2), 593–619. https://doi.org/10.1007/s11336-021-09817-7
- Ulitzsch, E., He, Q., & Pohl, S. (2022). Using sequence mining techniques for understanding incorrect behavioral patterns on interactive tasks. Journal of Educational and Behavioral Statistics, 47(1), 3-35. doi: 10.3102/10769986211010467
- Pohl, S., Schulze, D., & Stets, E. (2021). Partial measurement invariance: Extending and evaluating the cluster approach for identifying anchor items. Applied Psychological Measurement, 45(7-8), 477-493. https://doi.org/10.1177/01466216211042809
- Pohl, S., Ulitzsch, E., & von Davier, M. (2021). Reframing country rankings in educational assessments. Science, 372(6540), 338-340. doi:10.1126/science.abd3300
- Ranger, J., Kuhn, T., & Pohl, S. (2021). Effects of motivation on the accuracy and speed of responding in tests: The speed-accuracy tradeoff revisited. Measurement: Interdisciplinary Research and Perspectives, 19(1), 15-38. doi: 10.1080/15366367.2020.1750934
- Ulitzsch, E., He, Q., Ulitzsch, V., Molter, H., Nichterlein, A., Niedermeier, R., & Pohl, S. (2021). Combining Clickstream Analyses and Graph-Modeled Data Clustering for Identifying Common Response Processes. Psychometrika, 86, 190-214. doi: 10.1007/s11336-020-09743-0
- Ulitzsch, E., Penk, C., von Davier, M., & Pohl, S. (2021). Model meets reality: Validating a new behavioral measure for test-taking effort. Educational Assessment,26(2), 104-124. doi: 10.1080/10627197.2020.1858786
- Maia, D. D. A., Pohl, S., Okuda, P. M. M., Liu, T., Publisi, M. L., Ploubidis, G., Eid, M., Cogo-Moreira, H. (2020). Psychometric properties and optimizing of the Bracken School Readiness Assessment. Educational Assessment, Evaluation and Accountability, 1-13. doi: 10.1007/s11092-020-09339-3
- Schulze, D., & Pohl, S. (2021). Finding clusters of measurement invariant items for continuous covariates. Structural Equation Modeling: A Multidisciplinary Journal, 1-10. doi: 10.1080/10705511.2020.1771186
- Pohl, S., & Schulze, D. (2020). Assessing group comparisons or change over time under measurement non-invariance: The cluster approach for nonuniform DIF. Psychological Test and Assessment Modeling, 62(2), 281-303.
- Pohl, S. & Becker, B. (2020). Performance of missing data approaches under nonignorable missing data conditions. Methodology, 16(2), 147-165. doi: 10.5964/meth.2805
- Ulitzsch, E., von Davier, M., & Pohl, S. (2019). A hierarchical latent response model for inferences about examinee engagement in terms of guessing and item-level nonresponse. British Journal of Mathematical and Statistical Psychology. 73, 83-112. doi.org/10.1111/bmsp.12188
- Ulitzsch, E., von Davier, M., & Pohl, S. (2020). A Multiprocess item response model for not-reached items due to time limits and quitting. Educational and Psychological Measurement. 80(3), 522-547. doi:10.1177/0013164419878241
- Ulitzsch, E., von Davier, M., & Pohl, S. (2020). Using response times for joint modeling of response and omission behavior. Multivariate Behavioral Research, 55(3), 425-453. doi:10.1080/00273171.2019.1643699
- Schwabe, I., Gu, Z., Tijmstra, J., Hatemi, P., & Pohl, S. (2019). Psychometric modelling of longitudinal genetically-informative twin data. Frontiers in Genetics, 10, 837. doi:10.3389/fgene.2019.00837
- Pohl, S., Ulitzsch, E., & von Davier, M. (2019). Using response times to model not-reached items due to time limits.Psychometrika, 84(3), 892-920. doi:10.1007/s11336-019-09669-2
- Sengewald, M.-A., & Pohl, S. (2019). Compensation and amplification of attenuation bias in causal effect estimates. Psychometrika, 84(2), 589-610. doi:10.1007/s11336-019-09665-6
- Sachse, K., Mahler, N., & Pohl, S. (2018). When nonresponse mechanisms change: Effects on trends and group comparisons in international large-scale assessments. Educational and Psychological Measurement, 79(4), 699-726. doi:10.1177/0013164419829196
- Pohl, S. & von Davier, M. (2018) Commentary: On the importance of the speed-ability trade-off when dealing with not reached items. Frontiers in Psychology, 9:1988. doi: 10.3389/fpsyg.2018.01988
- Sengewald, M.-A., Steiner, P. M., & Pohl, S. (2019). When does measurement error in covariates impact causal effect estimates? - Analytical derivations of different scenarios and an empirical illustration. British Journal of Mathematical and Statistical Psychology, 72(2), 244-270. doi: 10.1111/bmsp.12146
- Köhler, C., Pohl, S., & Carstensen, C. H. (2017). Dealing with item nonresponse in large-scale cognitive assessments: The impact of missing data methods on estimated explanatory relationships. Journal of Educational Measurement, 54(4), 397-419. doi:10.1111/jedm.12154
- Haberkorn, K., Pohl, S., & Carstensen, C. H. (2016). Incorporating different response formats of competence tests in an IRT model. Psychological Test and Assessment Modeling, 58(2), 223-252.
- Pohl, S., Südkamp, A., Hardt, K., Carstensen, C. H., & Weinert, S. (2016). Testing Students with Special Educational Needs in Large-Scale Assessments: Psychometric Properties of Test Scores and Associations with Test Taking Behavior. Frontiers in Psychology, 154(7), 1-14. doi: 10.3389/fpsyg.2016.00154
- Aßmann, C., Gaasch, C., Pohl, S., & Carstensen, C. H. (2015). Bayesian estimation in IRT models with missing values in background variables. Psychological Test and Assessment Modeling, 57(4), 595-618.
- Köhler, C., Pohl, S., & Carstensen, C. H. (2015). Investigating mechanisms for missing responses in competence tests. Psychological Test and Assessment Modeling, 57(4), 499-522.
- Südkamp, A., Pohl, S., & Weinert, S. (2015). Competence assessment of students with special educational needs: Identification of appropriate testing accommodations. Frontline Learning Research, 3 (2), 1-25. doi:10.14786/flr.v3i2.130
- Köhler, C., Pohl, S., & Carstensen, C. H. (2015). Taking the missing propensity into account when estimating competence scores: Evaluation of IRT models for non-ignorable omissions. Educational and Psychological Measurement, 75(5), 850-875. doi:10.1177/0013164414561785
- Haberkorn, K., Lockl, K., Pohl, S., Ebert, S., & Weinert, S. (2014). Metacognitive knowledge in children at early elementary school. Metacognition and Learning, 9(3), 239-263. doi:10.1007/s11409-014-9115-1
- Pohl, S., Gräfe, L., & Rose, N. (2014). Dealing with omitted and not reached items in competence tests - Evaluating approaches accounting for missing responses in IRT models. Educational and Psychological Measurement, 74(3), 423-452. doi:10.1177/0013164413504926
- Pohl, S. (2013). Longitudinal multi-stage testing. Journal of Educational Measurement, 50(4), 447-468. doi: 10.1111/jedm.12028
- Pohl, S., & Carstensen, C. (2013). Scaling the competence tests in the National Educational Panel Study – Many questions, some answers, and further challenges. Journal for Educational Research Online, 5(2), 189-216.
- Raykov, T., & Pohl, S. (2013). Essential unidimensionality examination for multi-component scales: An interrelationship decomposition approach. Educational and Psychological Measurement, 73(4), 581-600. doi:10.1177/0013164412470451
- Raykov, T., & Pohl, S. (2013). On studying common factor variance in multiple component measuring instruments. Educational and Psychological Measurement, 73(2), 191-209. doi:10.1177/0013164412458673
- Cook, T. D., Pohl, S., & Steiner, P. M. (2011). Die relative Bedeutung der Kovariatenwahl, Reliabilität und Art der Datenanalyse für die Schätzung kausaler Effekte aus Beobachtungsdaten. Zeitschrift für Evaluation, 10(2), 203-224.
- Cook, T. D., Steiner, P. M., & Pohl, S. (2009). How bias reduction is affected by covariate choice, unreliability, and mode of data analysis: Results from two types of within-study Comparisons. Multivariate Behavioral Research, 44(6), 828-847. doi:10.1080/00273170903333673
- Pohl, S., & Steyer, R. (2010). Modeling common traits and method effects in multitrait-multimethod analysis. Multivariate Behavioral Research, 45(1), 45-72. doi:10.1080/00273170903504729
- Pohl, S., Steiner, P. M., Eisermann, J., Soellner, R., & Cook, T. D. (2009). Unbiased causal inference from an observational study: Results of a within-study comparison. Educational Evaluation and Policy Analysis, 31(4), 463-479. doi:10.3102/0162373709343964
- Vautier, S., & Pohl, S. (2009). Do balanced scales assess bipolar constructs? The case of the STAI scales. Psychological Assessment, 21(2), 187-193. doi: 10.1037/a0015312
- Pohl, S., Steyer, R., & Kraus, K. (2008). Modelling method effects as individual causal effects. Journal of the Royal Statistical Society, Series A, 171(1), 41-63. doi:10.1111/j.1467-985X.2007.00517.x
 Technical Reports- Krannich, M., Jost, O., Rohm, T., Koller, I., Pohl, S., Haberkorn, K., Carstensen, C. H., Fischer, L., & Gnambs, T. (2017). NEPS Technical Report for reading: Scaling results of Starting Cohort 3 for grade 7 (NEPS Survey Papers No. 14). Bamberg, Germany: Leibniz Institute for Educational Trajectories, National Educational Panel Study.
- Pohl, S., Stets, E., & Carstensen, C. H. (2017). Cluster-based anchor item identification and selection (NEPS Working Paper No. 68). Bamberg: Leibniz Institute for Educational Trajectories, National Educational Panel Study.
- Aßmann, C., Carstensen, C. H., Gaasch, C., & Pohl, S. (2014). Estimation of plausible values using background variables with missing values: A data augmented MCMC approach. (NEPS Working Paper No. 38). Bamberg: Leibniz Institute for Educational Trajectories, National Educational Panel Study.
- Pohl, S., Haberkorn, K.,& Hardt, K., (2014).NEPS Technical Report for Reading – Scaling results of starting cohort 5 for first-year students in main study 2010/11. (NEPS Working Paper No. 34). Bamberg: Leibniz Institute for Educational Trajectories, National Educational Panel Study.
- Hardt, K., Pohl, S. & Haberkorn, K. (2013).NEPS Technical Report for Reading – Scaling results of Starting Cohort 6 for adults in main study 2010/11 (NEPS Working Paper No. 25). Bamberg: Otto-Friedrich-Universität, Nationales Bildungspanel.
- Haberkorn, K., Pohl, S., Hardt, K., & Wiegand, E. (2012).NEPS Technical Report for Reading – Scaling results of starting cohort 4 in ninth grade (NEPS Working Paper No. 16). Bamberg: Otto-Friedrich-University, Nationales Bildungspanel.
- Pohl, S., Haberkorn, K., Hardt, K., & Wiegand, E. (2012).NEPS Technical Report for Reading – Scaling results of starting cohort 3 in fifth grade (NEPS Working Paper No. 15). Bamberg: Otto-Friedrich-University, Nationales Bildungspanel.
- Pohl, S., & Carstensen, C. (2012). NEPS Technical Report – Scaling the data of the competence tests (NEPS Working Paper No. 14). Bamberg: Otto-Friedrich-University, Nationales Bildungspanel.