MedImageInsight Embedding Usage + Report Generation Workflow (with Accuracy Clarification)

Ishant Singh 0 Reputation points
2025-08-06T05:36:36.7533333+00:00

Hello Team,

We’re currently evaluating MedImageInsight as part of a medical imaging workflow aimed at automated report generation to assist radiologists. We understand that the model generates embeddings from medical images, and we’re exploring the best way to utilize them effectively.

Below are our questions and goals:


🔹 Primary Objective:

To generate preliminary medical reports from medical images (or DICOMs) using MedImageInsight or any other AI model that can support this task.


🔹 Context:

  • We are not aiming for 100% diagnostic accuracy.
  • Our goal is to reduce reporting time and assist certified radiologists by providing a draft report that they will always review and finalize.
  • For our use case, 60–80% accuracy is acceptable, as it serves to accelerate workflow, not replace medical judgment.

🔹 Questions & Clarifications:

  1. Embedding Decoding
    • We understand MedImageInsight generates embeddings as output.
    • Some sources suggest these embeddings could be passed into another AI model to decode them into structured labels or full reports.
    • However, newer insights suggest decoding embeddings directly may not be possible.
    • Is there any model or recommended method to decode or interpret MedImageInsight embeddings into report-like outputs or labels?
  2. Label Dataset for Embedding Matching
    • If there’s a reference dataset of labeled embeddings (image-label or image-report pairs used during MedImageInsight’s training), we would like to access or replicate it.
    • We plan to perform similarity search between new image embeddings and this labeled set, and generate preliminary reports based on the closest matches.
  3. DICOM Support
    • Is it possible to provide DICOM files directly to MedImageInsight?
    • Converting DICOMs to PNG/JPEG seems to reduce volumetric (3D) detail, which may impact model effectiveness.
    • This is not a blocking requirement — we’re willing to proceed with 2D inputs, but would appreciate any guidance.
  4. Alternative Model Recommendation
    • If there’s any alternative AI model that directly generates radiology reports from medical images or DICOMs with 60–80% reliability, we are open to adopting that instead of or alongside MedImageInsight.

We’re building a clinical decision support tool that works alongside certified radiologists, not as a replacement. Any support, guidance, model links, or reference materials would be highly appreciated.

Thank you in advance for your assistance.

Best regards, Ishant Singh Founder – WhatsDiscuss | SmartAcademy | SmartCarePlus

Azure Machine Learning
{count} votes

1 answer

Sort by: Most helpful
  1. Pavankumar Purilla 10,425 Reputation points Microsoft External Staff Moderator
    2025-08-06T09:45:16.8766667+00:00

    Hi Ishant Singh,

    MedImageInsight is an embedding model designed to generate vector representations from medical images, which can be used for downstream tasks such as similarity search, classification, or report generation. The embeddings themselves cannot be directly decoded into structured labels or full reports. To generate draft reports, we recommend either fine-tuning a text decoder (e.g., a transformer) on top of the embeddings using image–report pairs or implementing a similarity search approach, where embeddings from new images are matched against a labeled embedding library to retrieve the closest reports for radiologist review.

    Currently, no public labeled embedding dataset is provided out of the box. However, you can create one by generating embeddings from your historical image–report archives or from publicly available datasets (e.g., MIMIC-CXR) and associating them with their corresponding labels or reports.

    Regarding DICOM support, MedImageInsight does not natively accept DICOM files. The images must be converted into 2D image formats such as PNG or JPEG for embedding generation. This conversion enables processing of individual slices or projections, though some volumetric and 3D spatial details may be lost. Azure supports this workflow through healthcare AI pipelines that include DICOM ingestion, conversion, and embedding generation, and Azure Health Data Services tools such as CloudSync and DICOMcast can be used to manage, synchronize, and query DICOM metadata and images in the cloud.

    If your goal is to generate preliminary reports more directly, you may also consider alternative AI models such as LLaVA‑Rad, a visual–language model trained on radiology data that can produce draft reports from medical images. This can be used alongside MedImageInsight for embedding-based similarity search to accelerate radiologist workflows while maintaining human oversight.

    For production scenarios, we recommend using the Azure orchestrated multimodal AI insights pipeline for MedImageInsight, which streamlines DICOM handling, embedding generation, and integration with downstream AI workflows.

    For your reference: Use orchestrate multimodal AI insights (preview) in healthcare data solutions
    Cloud migration for medical imaging data using Azure Health Data Services and IMS
    Discovering the Power of Finetuning MedImageInsight on Your Data

    How to use MedImageInsight healthcare AI model for medical image embedding generation
    Healthcare AI Models

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.