From 5c5e7daa573294857d7e4097acab0b23298b1ecb Mon Sep 17 00:00:00 2001
From: "Wang, Yi A" <yi.a.wang@intel.com>
Date: Sat, 17 Feb 2024 22:32:03 -0800
Subject: [PATCH] update README, add PPO support

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
---
 README.md             |  7 ++++---
 docs/source/index.mdx | 13 +++++++++++--
 2 files changed, 15 insertions(+), 5 deletions(-)
diff --git a/README.md b/README.md
index 1dd050ee9c..09a80082d6 100644
--- a/README.md
+++ b/README.md
@@ -197,7 +197,7 @@ The following model architectures, tasks and device distributions have been vali
 <div align="center">
 
 | Architecture     | Training | Inference            | Tasks |
-|------------------|:--------:|:--------------------:|:-----:|
+|------------------|:--------:|:--------------------:|:------|
 | Stable Diffusion |          | <li>Single card</li> | <li>[text-to-image generation](https://github.com/huggingface/optimum-habana/tree/main/examples/stable-diffusion)</li> |
 | LDM3D            |          | <li>Single card</li> | <li>[text-to-image generation](https://github.com/huggingface/optimum-habana/tree/main/examples/stable-diffusion)</li> |
 
@@ -208,8 +208,9 @@ The following model architectures, tasks and device distributions have been vali
 <div align="center">
 
 | Architecture     | Training | Inference            | Tasks |
-|------------------|:--------:|:--------------------:|:-----:|
-| Llama 2          | <li>Multi card</li> |           | <li>[DPO Pipeline](https://github.com/huggingface/optimum-habana/tree/main/examples/trl)</li> |
+|------------------|:--------:|:--------------------:|:------|
+| Llama 2          | :heavy_check_mark: |           | <li>[DPO Pipeline](https://github.com/huggingface/optimum-habana/tree/main/examples/trl)</li> |
+| Llama 2          | :heavy_check_mark: |           | <li>[PPO Pipeline](https://github.com/huggingface/optimum-habana/tree/main/examples/trl)</li> |
 
 </div>
 
diff --git a/docs/source/index.mdx b/docs/source/index.mdx
index fd394af6ff..ff85c83048 100644
--- a/docs/source/index.mdx
+++ b/docs/source/index.mdx
@@ -64,11 +64,20 @@ In the tables below, ✅ means single-card, multi-card and DeepSpeed have all be
 
 - Diffusers
 
-| Architecture     | Training | Inference | <center>Tasks</center> |
-|------------------|:--------:|:---------:|------------------------|
+| Architecture     | Training | Inference | Tasks |
+|------------------|:--------:|:---------:|:------|
 | Stable Diffusion |          | <div style="text-align:left"><li>Single card</li></div> | <li>[text-to-image generation](https://github.com/huggingface/optimum-habana/tree/main/examples/stable-diffusion)</li> |
 | LDM3D            |          | <div style="text-align:left"><li>Single card</li></div> | <li>[text-to-image generation](https://github.com/huggingface/optimum-habana/tree/main/examples/stable-diffusion)</li> |
 
+
+- TRL:
+
+| Architecture     | Training | Inference            | Tasks |
+|------------------|:--------:|:--------------------:|:------|
+| Llama 2          | ✅       |           | <li>[DPO Pipeline](https://github.com/huggingface/optimum-habana/tree/main/examples/trl)</li> |
+| Llama 2          | ✅       |           | <li>[PPO Pipeline](https://github.com/huggingface/optimum-habana/tree/main/examples/trl)</li> |
+
+
 Other models and tasks supported by the 🤗 Transformers and 🤗 Diffusers library may also work.
 You can refer to this [section](https://github.com/huggingface/optimum-habana#how-to-use-it) for using them with 🤗 Optimum Habana.
 Besides, [this page](https://github.com/huggingface/optimum-habana/tree/main/examples) explains how to modify any [example](https://github.com/huggingface/transformers/tree/main/examples/pytorch) from the 🤗 Transformers library to make it work with 🤗 Optimum Habana.