Aims
Barrier healing is emerging as a key therapeutic target in Inflammatory bowel disease (IBD), being more closely associated with sustained remission and improved long-term outcomes than endoscopic and histological remission alone. Accurate assessment of barrier integrity is therefore crucial to optimise disease management. Probe-based confocal laser endomicroscopy (pCLE) enables real-time structural and functional assessment of intestinal barrier. However, its interpretation remains complex and highly operator-dependent, limiting its reproducibility and widespread application. Preliminary studies have demonstrated the feasibility of using automated assessment of specific pCLE features to predict therapeutic response1. We aimed to develop and validate an artificial intelligence (AI)-driven pCLE system for standardised, objective assessment of barrier impairment (BI) in IBD.
Methods
156 high-quality pCLE videos from IBD patients undergoing colonoscopy in Ireland and UK were considered. Experienced endoscopists assessed the presence of BI according to a previously developed pCLE-scoring system2, providing a reference standard for AI. An AI algorithm was developed to predict pCLE-based BI using a weakly supervised learning paradigm that exclusively employs video-level scoring. Frame-level features were extracted with ViT pretrained under DINOv2 self-supervised learning framework3 and transformer-based architecture4 aggregated these into a global video representation. A linear layer classifier with two output neurons generated the final video-level BI score. A total of 105 videos were used for training and validation of the framework, while 51 were reserved for testing. The model’s ability to predict overall, epithelial, and vascular BI was evaluated. Diagnostic performance was expressed as sensitivity, specificity, accuracy, positive predictive value, negative predictive value, and F1 score.
Results
Overall, 48 IBD patients (28 CD and 20 UC) were considered. 119/156 (76%) videos showed BI, 68/101 (67%) in Ireland and 51/55 (93%) in the UK cohort. Table 1 details the diagnostic performance of the model. The AI algorithm achieved 77% sensitivity, 75% specificity, and 77% accuracy in detecting overall BI at the video level. Moreover, the model assessed the presence of epithelial and vascular BI with 70% and 85% sensitivity, 72% and 77% specificity, 71% and 82% accuracy. Diagnostic performance was higher in UC than CD.
|
|
Overall barrier impairment |
Epithelial barrier impairment |
Vascular barrier impairment |
||
|
|
IBD |
CD |
UC |
IBD |
IBD |
|
Accuracy (95% CI) |
76.5 [64.7-88.2] |
71.3 [53.6-85.7] |
85.0 [70.0-100] |
70.6 [56.9-82.4] |
82.4 [70.6-92.2] |
|
Sensitivity (95% CI) |
76.9 [63.2-89.5] |
71.3 [50.0-90.0] |
86.7 [66.7-100] |
69.7 [53.1-84.6] |
85.3 [72.4-96.8] |
|
Specificity (95% CI) |
75.0 [50.0-100] |
71.4 [33.3-100] |
79.9 [33.3-100] |
72.2 [50.0-92.9] |
76.5 [53.8-94.7] |
|
PPV (95% CI) |
90.9 [80.0-100] |
88.3 [70.6-100] |
92.8 [76.9-100] |
82.1 [66.7-95.8] |
87.9 [75.8-97.2] |
|
NPV (95% CI) |
50 [26.7-73.3] |
45.2 [15.4-75.0] |
66.7 [25.0-100] |
56.5 [35.7-76.6] |
72.2 [50.0-92.9] |
|
F1 score (95% CI) |
83.3 [73.0-91.7] |
78.4 [62.5-91.3] |
89.3 [75.0-100] |
75.4 [61.8-86.2] |
86.6 [76.7-94.4] |
|
AUC (95% CI) |
81.5 [66.2-93.9] |
82.9 [62.6-97.9] |
83.9 [55.6-100.0] |
83.7 [71.6-93-0] |
88.0 [76.5-96.6] |
Conclusions
Our AI model enables automated and standardised assessment of intestinal impairment in IBD, offering an objective and reproducible tool to monitor barrier healing as a novel therapeutic endpoint.