Analyzes medical images (X-ray, MRI, CT) using multimodal LLMs to identify anomalies and generate reports.
The Multimodal Medical Imaging Analysis Skill leverages state-of-the-art Vision-Language Models (VLMs) like Gemini 1.5 Pro and GPT-4o to interpret medical imagery alongside clinical text.
User: "Analyze this chest X-ray for pneumonia."
Agent Action:
python3 Skills/Clinical/Medical_Imaging/Multimodal_Analysis/multimodal_agent.py \
--image "/path/to/cxr.jpg" \
--prompt "Check for signs of pneumonia and consolidation."