Hi Ishant Singh,
MedImageInsight is an embedding model designed to generate vector representations from medical images, which can be used for downstream tasks such as similarity search, classification, or report generation. The embeddings themselves cannot be directly decoded into structured labels or full reports. To generate draft reports, we recommend either fine-tuning a text decoder (e.g., a transformer) on top of the embeddings using image–report pairs or implementing a similarity search approach, where embeddings from new images are matched against a labeled embedding library to retrieve the closest reports for radiologist review.
Currently, no public labeled embedding dataset is provided out of the box. However, you can create one by generating embeddings from your historical image–report archives or from publicly available datasets (e.g., MIMIC-CXR) and associating them with their corresponding labels or reports.
Regarding DICOM support, MedImageInsight does not natively accept DICOM files. The images must be converted into 2D image formats such as PNG or JPEG for embedding generation. This conversion enables processing of individual slices or projections, though some volumetric and 3D spatial details may be lost. Azure supports this workflow through healthcare AI pipelines that include DICOM ingestion, conversion, and embedding generation, and Azure Health Data Services tools such as CloudSync and DICOMcast can be used to manage, synchronize, and query DICOM metadata and images in the cloud.
If your goal is to generate preliminary reports more directly, you may also consider alternative AI models such as LLaVA‑Rad, a visual–language model trained on radiology data that can produce draft reports from medical images. This can be used alongside MedImageInsight for embedding-based similarity search to accelerate radiologist workflows while maintaining human oversight.
For production scenarios, we recommend using the Azure orchestrated multimodal AI insights pipeline for MedImageInsight, which streamlines DICOM handling, embedding generation, and integration with downstream AI workflows.
For your reference: Use orchestrate multimodal AI insights (preview) in healthcare data solutions
Cloud migration for medical imaging data using Azure Health Data Services and IMS
Discovering the Power of Finetuning MedImageInsight on Your Data
How to use MedImageInsight healthcare AI model for medical image embedding generation
Healthcare AI Models