Skip to content

Latest commit

 

History

History
54 lines (38 loc) · 6.81 KB

applications.md

File metadata and controls

54 lines (38 loc) · 6.81 KB

Applications

Named Entity Linking

Named entity linking (NEL) is the task of linking ambiguous mentions in text to entities in a knowledge base. NEL is a core preprocessing step in downstream applications, including search and question answering.

Medical Imaging

  • Sensitive to inputs, not models

    • The variation of imaging configurations (e.g. site locations), hardware, and processing techniques (e.g. CT windowing) lead to large performance shifts
    • Recent medical imaging challenges (segmentation: knee, brain, reconstruction: MRI), found that, to a large extent, the choice of model is less important than the underlying distribution of data (e.g. disease extent)
  • Towards multi-modal data fusion

    • Radiologist reports (and more generally text) have been used to improve learned visual representations (e.g. ConVIRT) and to source weak labels in annotation-scarce settings (e.g. (PET/CT))
    • Auxiliary features from other rich, semi-structured data, such as electronic health records (EHRs), successfully complemented standard image representations
  • Data Models for Dataset Drift Controls in Machine Learning With Images

    • Drift synthesis enables the controlled generation of physically faithful drift test cases. The experiments presented here show that the average decrease in model performance is ten to four times less severe than under post-hoc augmentation testing.
    • The gradient connection between task and data models allows for drift forensics that can be used to specify performance-sensitive data models which should be avoided during deployment of a machine learning model.
    • Drift adjustment opens up the possibility for processing adjustments in the face of drift. This can lead to speed up and stabilization of classifier training at a margin of up to 20% in validation accuracy.

Computational Biology

Remote Sensing

- [Data Models](https://arxiv.org/abs/2211.02578) - Tolerancing allows for for prospective validation of machine learning task model under physically faithful dataset drifts - Differentiable data models allow the optimization of the data generating process