Masked latent transformer with random masking ratio to advance the diagnosis of dental fluorosis

Jun 5, 2025ยท
Hao Xu
Hao Xu
,
Yun Wu
,
Junpeng Wu
,
Rui Xie
,
Maohua Gu
,
Rongpin Wang
ยท 1 min read
MLTrMR Model Structure Diagram
Abstract
Dental fluorosis is a chronic condition caused by long-term overconsumption of fluoride, which leads to changes in the appearance of tooth enamel. Diagnosing its severity can be challenging for dental professionals, and limited research on deep learning applications in this field. Therefore, we propose a novel deep learning model, masked latent transformer with random masking ratio (MLTrMR), to advance the diagnosis of dental fluorosis. MLTrMR enhances contextual learning by using a masked latent modeling scheme based on Vision Transformer. It extracts latent tokens from the original image with a latent embedder, processes unmasked tokens with a latent transformer (LT) block, and predicts masked tokens. To improve model performance, we incorporate an auxiliary loss function. MLTrMR achieves state-of-the-art results, with 80.19% accuracy, 75.79% F1 score, and 81.28% quadratic weighted kappa on the first open-source dental fluorosis image dataset (DFID) we constructed.
Type
Publication
Journal of Visual Communication and Image Representation, 104496
Note

Click the Cite button above to cite this paper.

@article{xu2025masked,
  title={Masked latent transformer with random masking ratio to advance the diagnosis of dental fluorosis},
  author={Xu, Hao and Wu, Yun and Wu, Junpeng and Xie, Rui and Gu, Maohua and Wang, Rongpin},
  journal={Journal of Visual Communication and Image Representation},
  pages={104496},
  year={2025},
  publisher={Elsevier}
}