Example questions to review paper
what do they do with the input data?
what model do they use (what's the underlying self-supervised learning task)
what do they obtain (it seems they obtain a BERT model often, which is encoder only so you get a bunch of embeddings)
what do they test this stuff on?
Shaky fundations
The review examines 84 foundation models trained on non-imaging EHR data, creating a taxonomy of their architectures, training data, and potential use cases.
Most models are trained on small clinical datasets like MIMIC-III or broad biomedical corpora like PubMed and are evaluated on tasks that do not necessarily reflect their utility in health systems.