Portal > Offres > Offre UMR7295-THIOLI-001 - Ingénieur d'étude en psychologie (H/F)

Engineer in psycholingistics and data processing

This offer is available in the following languages:
Français - Anglais

General information

Reference : UMR7295-THIOLI-001
Workplace : POITIERS
Date of publication : Tuesday, September 03, 2019
Type of Contract : FTC Technical / Administrative
Contract Period : 12 months
Expected date of employment : 1 November 2019
Proportion of work : Full time
Remuneration : Between 2076 and 2184 euros gross monthly according professional experience
Desired level of education : 5-year university degree
Experience required : 1 to 4 years


At the CRS, the Centre de Recherche sur la Cognition et l'Apprentissage (University of Poitiers) searches for an Engineer in psycholingistics and data processing for a project funded by the Agence Nationale de la Recherche (France) :
Pro-TEXT: Processes of textualization: linguistic, psycholinguistic, and machine learning modelling

This research project aims at providing a comprehensive analysis of the textualization process, i.e. the real-time progressive construction of a text. Written texts are produced under complex constraints, which are now possible to capture with real-time recording of writing process using keystroke logging. Real-time recording of the writing process produces temporal data (e.g., pause length), language data (language sequences produced continuously between two pauses, i.e. bursts of written language), and topological data indicating location of the textual operations in the text.
The main objects of the project are bursts of writing, which are textual segments produced between two pauses as for example in: [pause] une cousine qui [pause] peut venir partager du temps avec elle pendant [pause] le [pause] w [pause] eek [pause] – [pause] end. [pause]
The Pro-TEXT project develops linguistic and psycholinguistic methods and machine-learning tools to provide insight into the relation between regularities of language performance and the cognitive and contextual constraints. The project relies on a corpus of chains of language (bursts) and of pauses and includes more than 30,000 bursts.
The work that will be made for the Pro-TEXT project aims to elucidate the dynamics of the textualization process by modelling the relations between the temporal indices of cognitive processes (such as pauses) and the nature of bursts of writing in French and in English-French translations. We argue that the way linguistic sequences linearly articulate during the process of textualization accounts for the multilevel constraints underlying language performance. More specifically, the issue is:
i) to unearth the linguistic strings chosen by writers to build up their texts and the links by which they are interconnected;
ii) to identify the types of sequences that constitute the linguistic material for textualization,
iii) to fix the rules and layout regularities that support their organization in a formally and semantically valid text and the combinatorial strategies used by writers in various contexts and text genres;
iv) to interpret the pauses of production and the bursts of writing by identifying the cognitive processes underlying them and how variations in cognitive demands affect these pauses and bursts, as well as the linguistic forms and functions of bursts.


The selected candidate will have to processes log files in order to gather dynamical parameters of pauses and bursts parameters (length, duration) with the aim of examining how the pause preceding a burst and the burst parameter vary with different thresholds and with different cognitive constraints. The selected candidate will be in charge of data processing of the log files, of extracting pauses and bursts after having applied the different criteria of pause thresholds.
Relating pauses and bursts is of crucial importance for further comprehending how adopting a particular pause may affect burst characteristics. Starting from the hypothesis that cognitive and linguistic factors impact pause duration and therefore subsequent bursts, the selected candidate will apply statistical models of pause length and burst characteristics In particular, we assume that different dimensions of pauses and bursts can be grouped in classes, each class referring to a set of cognitive writing processes. Conversely, we will examine how the length of a specific pause may determine the following burst.

For any questions about the project, candidates are invited to contact the project leader directly by email (see below).


The selected candidate will have to:

1. Review the scientific literature on the study of written production, and in particular the real-time analysis of writing.
2. Participate in data collection
3. Process writing logs collected via keylogging (inputlog, scriptlog ...)
4. Develop analyses scripts
4. Perform statistical analyses (distribution analyses, clustering analyses, mathematical and statistical model, parametric and non-parametric regressions)

The recruited research assistant must hold a Master's degree in psychology (cognitive psychology, psycholinguistics) or a Master's degree in linguistics (but with a very good knowledge in psycholinguistics of writing). Statistical skills are needed. Finally, programming skills will be welcome.
A knowledge of French is also necessary to work on the categorization of bursts of writing.

Documents to attach to a single PDF file:
- A CV.
- A cover letter explaining how the skills of the candidates are adapted to the work expected.
- The copy of the Master's thesis and a copy of the diploma of Master 2 (if available).
- A summary of previous scientific and research work.

The pre-selected candidates will be invited to present their application in an interview. More detailed information on the modalities of the interview will be sent to the pre-selected candidates.

Work Context

The selected candidate will be hosted by the CeRCA lab at the MSHS de Poitiers ( and will join the Writing group (

