trigaten · trigaten · Mar 4, 2023
diff --git a/docs/assets/multimodal_cot.png b/docs/assets/multimodal_cot.png
diff --git a/docs/miscl/mutli_cot.md b/docs/miscl/mutli_cot.md
@@ -0,0 +1,13 @@
+---
+sidebar_position: 10
+---
+
+# 🟡 Multimodal CoT
+
+You've seen chain of thought used with text, but can it be used to answer questions about images? The answer is yes! This is called multimodal chain of thought (i.e. there are 2 modalities involved, text *and* image). The process is similar to the text version, but there are some differences.
+
+import multimodal_cot from '../assets/multimodal_cot.png';
+
+<div style={{textAlign: 'center'}}>
+  <img src={multimodal_cot} style={{width: "500px"}} />
+</div>