Abstract

Web-based and machine learning approaches for identification of patient-reported outcomes in inflammatory bowel disease

Dig Liver Dis. 2022 Apr;54(4):483-489.doi: 10.1016/j.dld.2021.09.005. Epub 2021 Sep 26.

 

Laetitia Ricci 1Yannick Toussaint 2Justine Becker 3Hiba Najjar 4Alix Renier 5Myriam Choukour 6Anne Buisson 7Corinne Devos 7Jonathan Epstein 8Laurent Peyrin Biroulet 9Francis Guillemin 10

 
     

Author information

1CHRU-Nancy, INSERM, Université de Lorraine, CIC 1433 Clinical Epidemiology, F-54000 Nancy, France. Electronic address: l.ricci@chru-nancy.fr.

2Laboratoire lorrain de recherche en informatique et ses applications, Université de Lorraine, Nancy, France. Electronic address: yannick.toussaint@loria.fr.

3Ecole des mines de Nancy, Université de Lorraine, Nancy, France. Electronic address: justine.becker5@etu.univ-lorraine.fr.

4Ecole des mines de Nancy, Université de Lorraine, Nancy, France. Electronic address: hiba.najjar@etu.mines-nancy.univ-lorraine.fr.

5Ecole des mines de Nancy, Université de Lorraine, Nancy, France. Electronic address: alix.renier@etu.mines-nancy.univ-lorraine.fr.

6INSERM, U1256 NGERE and gastroenterology Department, CHRU-Nancy, Université de Lorraine, Nancy, France. Electronic address: m.choukour@chru-nancy.fr.

7afa Crohn RCH, France.

8CHRU-Nancy, INSERM, Université de Lorraine, CIC 1433 Clinical Epidemiology, F-54000 Nancy, France; Université de Lorraine, APEMAC, F-54000 Nancy, France. Electronic address: j.epstein@chru-nancy.fr.

9INSERM, U1256 NGERE and gastroenterology Department, CHRU-Nancy, Université de Lorraine, Nancy, France.

10CHRU-Nancy, INSERM, Université de Lorraine, CIC 1433 Clinical Epidemiology, F-54000 Nancy, France; Université de Lorraine, APEMAC, F-54000 Nancy, France. Electronic address: francis.guillemin@chru-nancy.fr.

Abstract

Background: Messages from an Internet forum are raw material that emerges in a natural setting (i.e., non-induced by a research situation).

Aims: The FLARE-IBD project aimed at using an innovative approach consisting of collecting messages posted by patients in an Internet forum and conducting a machine-learning study (data analysis/language processing) for developing a patient-reported outcome measuring flare in inflammatory bowel disease meeting international requirements.

Methods: We used web-based and machine learning approaches, in the following steps. 1) Web-scraping to collect all available posts in an Internet forum (23 656 messages) and extracting metadata from the forum. 2) Twenty patients were randomly assigned 50 extracted messages; participants indicated whether the message corresponded or not to the flare phenomenon (labeling). If yes, participants were asked to identify excerpts from the text they considered significant flare markers (annotation). 3) The set of annotated messages underwent a vocabulary analysis.

Results: The phenomenon of flare was circumscribed with the identification of 20 surrogate flare markers classified into five dimensions with their frequency within extracted labeled data: impact on life, symptoms, extra-intestinal manifestations, drugs and environmental factors. Web-based and machine-learning approaches met international recommendations to inform the content and structure for the development of patient-reported outcomes.

© Copyright 2013-2022 GI Health Foundation. All rights reserved.
This site is maintained as an educational resource for US healthcare providers only. Use of this website is governed by the GIHF terms of use and privacy statement.