pytorch
diff --git a/‎.gitignore‎
Lines changed: 0 additions & 1 deletion b/‎.gitignore‎
Lines changed: 0 additions & 1 deletion
diff --git a/‎backends/xnnpack/README.md‎
Lines changed: 1 addition & 1 deletion b/‎backends/xnnpack/README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/source/backends/template/README.md‎
Lines changed: 46 additions & 0 deletions b/‎docs/source/backends/template/README.md‎
Lines changed: 46 additions & 0 deletions
diff --git a/‎docs/source/backends/template/backend-arch-internals.md‎
Lines changed: 8 additions & 0 deletions b/‎docs/source/backends/template/backend-arch-internals.md‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎docs/source/backends/template/backend-op-support.rst‎
Lines changed: 13 additions & 0 deletions b/‎docs/source/backends/template/backend-op-support.rst‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎docs/source/backend-template.md‎ renamed to ‎docs/source/backends/template/backend-overview.md‎
Lines changed: 1 addition & 19 deletions b/‎docs/source/backend-template.md‎ renamed to ‎docs/source/backends/template/backend-overview.md‎
Lines changed: 1 addition & 19 deletions
diff --git a/‎docs/source/backends/template/backend-partitioner.md‎
Lines changed: 3 additions & 0 deletions b/‎docs/source/backends/template/backend-partitioner.md‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎docs/source/backends/template/backend-quantization.md‎
Lines changed: 87 additions & 0 deletions b/‎docs/source/backends/template/backend-quantization.md‎
Lines changed: 87 additions & 0 deletions
diff --git a/‎docs/source/backends/template/backend-troubleshooting.md‎
Lines changed: 15 additions & 0 deletions b/‎docs/source/backends/template/backend-troubleshooting.md‎
Lines changed: 15 additions & 0 deletions
diff --git a/‎docs/source/backends/template/guides/backend-basic-guide.md‎
Lines changed: 3 additions & 0 deletions b/‎docs/source/backends/template/guides/backend-basic-guide.md‎
Lines changed: 3 additions & 0 deletions
@@ -62,7 +62,6 @@ xcuserdata/
 /include/
 /share/
 /version.py
-*.csv
 *_etdump
 
 # Android
 
@@ -134,4 +134,4 @@ create an issue on [github](https://www.github.com/pytorch/executorch/issues).
 ## See Also
 For more information about the XNNPACK Backend, please check out the following resources:
 - [XNNPACK Backend](https://pytorch.org/executorch/main/backends-xnnpack)
-- [XNNPACK Backend Internals](https://pytorch.org/executorch/main/backend-delegates-xnnpack-reference)
+- [XNNPACK Backend Internals](https://pytorch.org/executorch/main/backends/xnnpack/backend-delegates-xnnpack-reference)
@@ -0,0 +1,46 @@
+# Backend Documentation Template
+
+This template provides a standardized structure and starting point for backend documentation. It is intended to provide a uniform experience for users while allowing for backends to customize their documentation as needed.
+
+## Template Structure
+
+The template includes the following files:
+
+### Required Pages
+
+- `backend-overview.md` - Main backend overview and introduction
+
+### Recommended Pages
+
+- `backend-quantization.md` - Quantization support and API documentation
+- `backend-partitioner.md` - Partitioner API reference
+- `backend-op-support.rst` - Operator support documentation (RST format)
+- `op-support.csv` - Operator support data in CSV format
+
+### Optional Pages (and Subsections)
+
+- `backend-troubleshooting.md` - Common issues and troubleshooting guide
+- `backend-arch-internals.md` - Architecture and internals documentation
+- `tutorials/backend-tutorials.md` - Tutorial sub-section
+  - Use this sub-section to provide tutorials for your backend. Tutorials should present information about a use case in an end to end manner.
+- `tutorials/backend-guides.md` - Guides sub-section
+  - Use this sub-section to provide guides or how-tos for backend-specific use cases or functionality. Examples might be static attention or device-specific memory management. These are intended to be used as a reference.
+
+## Using the Template
+
+To use this template for a new backend:
+
+1. Copy the entire `template` directory contents to your backend's documentation directory
+2. Rename files to match your backend name (e.g., `backend-overview.md` → `mybackend-overview.md`)
+3. Populate the content for your backend.
+
+### Additional Customization
+
+You may need to:
+- Add backend-specific sections to any file
+- Remove sections that don't apply to your backend
+- Update the operator support CSV with your backend's supported operators
+- Add backend-specific images or diagrams
+- Update cross-references and links
+
+Try to keep the landing page (`backend-overview.md`) simple and straigtforward. Use the child pages and sections to provide more detailed information.
@@ -0,0 +1,8 @@
+# {BACKEND_NAME} Architecture and Internals
+
+This page covers internal implementation details of the backend, and is mainly aimed at contributors and heavy power users. This is an optional page for each backend and has no set structure.
+
+Some topics to consider:
+ * High-level design of the backend
+ * Details on the lowering flow
+ * Internal debugging tools and techniques
@@ -0,0 +1,13 @@
+================
+Operator Support
+================
+
+This page lists the operators supported by the {BACKEND_NAME} backend. Operators are the building blocks of the ML model. See `IRs <https://docs.pytorch.org/docs/stable/torch.compiler_ir.html>`_ for more information on the PyTorch operator set.
+
+{OPERATOR_SUPPORT_NOTES}
+
+.. csv-table:: Operator Support
+   :file: op-support.csv
+   :header-rows: 1
+   :widths: 20 15 30 30
+   :align: center
@@ -4,7 +4,7 @@ Provide a brief overview/description of the backend. At a high-level, what does
 
 ## Features
 
-List high-level features of backend, such as general operator and hardware support.
+List high-level features of backend, such as operator and hardware support.
 
 ## Target Requirements
 
@@ -18,24 +18,6 @@ What software and hardware is needed to create a .PTE file targeting this backen
 
 This section describes the steps users need to take in order to generate a .PTE targeting this backend. Include a full code sample for exporting and lowering a model to this backend. Make sure relevant imports for the backend partitioner are included.
 
-### Partitioner API
-
-What options, if any, does the partitioner take? Are there any other export-time configurations that can be applied? Document each option.
-
-### Quantization
-
-What quantization schemes does this backend support? Consider including the following, as appropriate.
-- What operators are supported?
-- Number of bits?
-- Static vs dynamic activations?
-- Weight only vs activations + weights?
-- Symmetric vs asymmetric weights?
-- Per-tensor, per-chanel, group/blockwise?
-
-If using a PT2E quantizer, document how to initialize the quantizer and all relevant configs and options.
-
-Include a code snippet demonstrating how to perform quantization for this backend. Document, or link to, a description of the parameters that the user can specify.
-
 ## Runtime Integration
 
 This section is intended to tell the user all of the steps they'll need to take to be able to run a .PTE file on-device that is targeting the given backend.
 
@@ -0,0 +1,3 @@
+# {BACKEND_NAME} Partitioner API
+
+Documentate the partitioner API for the backend, including configuration options and compile specs.
@@ -0,0 +1,87 @@
+# {BACKEND_NAME} Quantization
+
+Document quantization schemes and flows for the backend. This should include a description of each scheme and a code example to perform quantization. Example sections for PT2E and quantize_ are included below, to be replaced with details for the target backend.
+
+### Supported Quantization Schemes
+The {BACKEND_NAME} delegate supports the following quantization schemes:
+
+- {QUANTIZATION_SCHEME_1}
+- {QUANTIZATION_SCHEME_2}
+
+### {QUANTIZATION_METHOD_1} using the PT2E Flow
+
+To perform {QUANTIZATION_METHOD_1} with the PT2E flow, perform the following steps prior to exporting the model:
+
+1) Create an instance of the `{BackendName}Quantizer` class. Set quantization parameters.
+2) Use `torch.export.export` to prepare for quantization.
+3) Call `prepare_pt2e` to prepare the model for quantization.
+4) For static quantization, run the prepared model with representative samples to calibrate the quantized tensor activation ranges.
+5) Call `convert_pt2e` to quantize the model.
+6) Export and lower the model using the standard flow.
+
+The output of `convert_pt2e` is a PyTorch model which can be exported and lowered using the normal flow. As it is a regular PyTorch model, it can also be used to evaluate the accuracy of the quantized model using standard PyTorch techniques.
+
+```python
+import torch
+import {MODEL_IMPORT_PATH} as models
+from {MODEL_WEIGHTS_IMPORT}
+from executorch.backends.{backend_name}.quantizer.{backend_name}_quantizer import {BackendName}Quantizer, {get_quantization_config_function}
+from executorch.backends.{backend_name}.partition.{backend_name}_partitioner import {BackendName}Partitioner
+from executorch.exir import to_edge_transform_and_lower
+from torchao.quantization.pt2e.quantize_pt2e import convert_pt2e, prepare_pt2e
+
+model = models.{model_name}.{model_function}(weights={ModelWeights}.DEFAULT).eval()
+sample_inputs = ({SAMPLE_INPUT_SHAPE}, )
+
+qparams = {get_quantization_config_function}({QUANTIZATION_PARAMS}) # (1)
+quantizer = {BackendName}Quantizer()
+quantizer.set_global(qparams)
+
+training_ep = torch.export.export(model, sample_inputs).module() # (2)
+prepared_model = prepare_pt2e(training_ep, quantizer) # (3)
+
+for cal_sample in [{CALIBRATION_SAMPLE}]: # Replace with representative model inputs
+	prepared_model(cal_sample) # (4) Calibrate
+
+quantized_model = convert_pt2e(prepared_model) # (5)
+
+et_program = to_edge_transform_and_lower( # (6)
+    torch.export.export(quantized_model, sample_inputs),
+    partitioner=[{BackendName}Partitioner()],
+).to_executorch()
+```
+
+See [PyTorch 2 Export Post Training Quantization](https://docs.pytorch.org/ao/main/tutorials_source/pt2e_quant_ptq.html) for more information.
+
+### LLM Quantization with quantize_
+
+The {BACKEND_NAME} backend also supports quantizing models with the [torchao](https://github.com/pytorch/ao) quantize_ API. {ADVANCED_QUANTIZATION_DESCRIPTION}
+
+Below is a simple example, but a more detailed tutorial including accuracy evaluation on popular benchmarks can be found in the [torchao documentation]({TORCHAO_DOCS_URL}).
+
+```python
+from torchao.quantization.granularity import PerGroup, PerAxis
+from torchao.quantization.quant_api import (
+    IntxWeightOnlyConfig,
+    Int8DynamicActivationIntxWeightConfig,
+    quantize_,
+)
+
+# Quantize embeddings with 8-bits, per channel
+embedding_config = IntxWeightOnlyConfig(
+    weight_dtype=torch.int8,
+    granularity=PerAxis(0),
+)
+qunatize_(
+    eager_model,
+    lambda m, fqn: isinstance(m, torch.nn.Embedding),
+)
+
+
+# Quatize linear layers with 8-bit dynamic activations and 4-bit weights
+linear_config = Int8DynamicActivationIntxWeightConfig(
+    weight_dtype=torch.int4,
+    weight_granularity=PerGroup(32),
+)
+quantize_(eager_model, linear_config)
+```
@@ -0,0 +1,15 @@
+# {BACKEND_NAME} Troubleshooting
+
+This page describes common issues that you may encounter when using the {BACKEND_NAME} backend and how to debug and resolve them.
+
+## {COMMON_ISSUE_1}
+
+{ISSUE_DESCRIPTION_1}
+
+{SOLUTION_STEPS_1}
+
+## {COMMON_ISSUE_2}
+
+{ISSUE_DESCRIPTION_2}
+
+{SOLUTION_STEPS_2}
@@ -0,0 +1,3 @@
+# Using {FEATURE} on {BACKEND_NAME}
+
+This is a placeholder guide.
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+# {BACKEND_NAME} Partitioner API`
	`2`	`+`
	`3`	`+Documentate the partitioner API for the backend, including configuration options and compile specs.`
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+# Using {FEATURE} on {BACKEND_NAME}`
	`2`	`+`
	`3`	`+This is a placeholder guide.`