Mod readme of Neural Coder (#1405)

kaikaiyao · chensuyue · commit 7164e32efa8b · 2022-11-01T00:18:11.000+08:00
* Update README.md * Update README.md * Update README.md * Update README.md * Create PythonAPI.md * Update PythonAPI.md * Update README.md * Update PythonAPI.md * Create SupportMatrix.md * Create SupportMatrix.md * Update SupportMatrix.md * Update SupportMatrix.md * Update README.md * Update README.md * Update SupportMatrix.md * Update SupportMatrix.md * Update SupportMatrix.md * Update SupportMatrix.md * Update SupportMatrix.md (cherry picked from commit 1690c5c)
diff --git a/README.md b/README.md
@@ -64,7 +64,7 @@ dataset = quantizer.dataset('dummy', shape=(1, 224, 224, 3))
 quantizer.calib_dataloader = common.DataLoader(dataset)
 quantizer.fit()
 ```
-### Quantization with [JupyterLab Extension](./neural_coder/extensions/neural_compressor_ext_lab/README.md) (Experimental)
+### Quantization with [JupyterLab Extension](./neural_coder/extensions/neural_compressor_ext_lab/README.md)
 Search for ```jupyter-lab-neural-compressor``` in the Extension Manager in JupyterLab and install with one click:
 
 <a target="_blank" href="./neural_coder/extensions/screenshots/extmanager.png">
@@ -84,20 +84,6 @@ inc_bench
   <img src="./docs/imgs/INC_GUI.gif" alt="Architecture">
 </a>
 
-### Quantization with [Neural Coder](./neural_coder/docs/Quantization.md) (Experimental)
-
-```python
-from neural_coder import auto_quant
-auto_quant(
-    code="https://github.com/huggingface/transformers/blob/v4.21-release/examples/pytorch/text-classification/run_glue.py",
-    args="--model_name_or_path albert-base-v2 \
-          --task_name sst2 \
-          --do_eval \
-          --output_dir result \
-          --overwrite_output_dir",
-)
-```
-
 ## System Requirements
 
 ### Validated Hardware Environment
@@ -252,7 +238,7 @@ Intel® Neural Compressor validated 420+ [examples](./examples) for quantization
     </tr>
     <tr>
         <td colspan="4" align="center"><a href="docs/distillation_quantization.md">Distillation for Quantization</a></td>
-        <td colspan="5" align="center"><a href="neural_coder">Neural Coder (No-Code Solution)</a></td>
+        <td colspan="5" align="center"><a href="neural_coder">Neural Coder</a></td>
     </tr>    
     
   </tbody>
diff --git a/neural_coder/README.md b/neural_coder/README.md
@@ -35,62 +35,13 @@ simultaneously on below PyTorch evaluation code, we generate the optimized code
 
 ## Getting Started!
 
-### Neural Coder for Quantization
-We provide a feature that helps automatically enable quantization on Deep Learning models and automatically evaluates for the best performance on the model. It is a code-free solution that can help users enable quantization algorithms on a model with no manual coding needed. Supported features include Post-Training Static Quantization, Post-Training Dynamic Quantization, and Mixed Precision. For more details please refer to this [guide](docs/AutoQuant.md).
+There are currently 2 ways to use Neural Coder for automatic quantization enabling and benchmark.
 
-### General Guide
-We currently provide 3 main user-facing APIs for Neural Coder: enable, bench and superbench.
-#### Enable
-Users can use ```enable()``` to enable specific features into DL scripts:
-```
-from neural_coder import enable
-enable(
-    code="neural_coder/examples/vision/resnet50.py",
-    features=[
-        "pytorch_jit_script",
-        "pytorch_channels_last",
-    ],
-)
-```
-To run benchmark directly on the optimization together with the enabling:
-```
-from neural_coder import enable
-enable(
-    code="neural_coder/examples/vision/resnet50.py",
-    features=[
-        "pytorch_jit_script",
-        "pytorch_channels_last"
-    ],
-    run_bench=True,
-)
-```
-#### Bench
-To run benchmark on your code with an existing patch:
-```
-from neural_coder import bench
-bench(
-    code="neural_coder/examples/vision/resnet50.py",
-    patch_path="${your_patch_path}",
-)
-```
-#### SuperBench
-To sweep on optimization sets with a fixed benchmark configuration:
-```
-from neural_coder import superbench
-superbench(code="neural_coder/examples/vision/resnet50.py")
-```
-To sweep on benchmark configurations for a fixed optimization set:
-```
-from neural_coder import superbench
-superbench(
-    code="neural_coder/examples/vision/resnet50.py",
-    sweep_objective="bench_config",
-    bench_feature=[
-        "pytorch_jit_script",
-        "pytorch_channels_last",
-    ],
-)
-```
+### Jupyter Lab Extension
+We offer Neural Coder as an extension plugin in Jupyter Lab. This enables users to utilize Neural Coder while writing their Deep Learning models in Jupyter Lab coding platform. Users can simply search for ```jupyter-lab-neural-compressor``` in the Extension Manager in JupyterLab and install Neural Coder with one click. For more details, please refer to this [guide](extensions/neural_compressor_ext_lab/README.md)
+
+### Python API
+There are 3 user-facing APIs for Neural Coder: enable, bench and superbench. For more details, please refer to this [guide](docs/PythonAPI.md). We have provided a [list](docs/SupportMatrix.md) of supported Deep Learning optimization features. Specifically for quantization, we provide an auto-quantization API that helps automatically enable quantization on Deep Learning models and automatically evaluates for the best performance on the model with no manual coding needed. Supported features include Post-Training Static Quantization, Post-Training Dynamic Quantization, and Mixed Precision. For more details, please refer to this [guide](docs/Quantization.md).
 
 ## Contact
 Please contact us at [inc.maintainers@intel.com](mailto:inc.maintainers@intel.com) for any Neural Coder related question.
diff --git a/neural_coder/docs/PythonAPI.md b/neural_coder/docs/PythonAPI.md
@@ -0,0 +1,58 @@
+Neural Coder as Python API
+===========================
+
+We currently provide 3 main user-facing APIs for Neural Coder: enable, bench and superbench.
+
+#### Enable
+Users can use ```enable()``` to enable specific features into DL scripts:
+```
+from neural_coder import enable
+enable(
+    code="neural_coder/examples/vision/resnet50.py",
+    features=[
+        "pytorch_jit_script",
+        "pytorch_channels_last",
+    ],
+)
+```
+To run benchmark directly on the optimization together with the enabling:
+```
+from neural_coder import enable
+enable(
+    code="neural_coder/examples/vision/resnet50.py",
+    features=[
+        "pytorch_jit_script",
+        "pytorch_channels_last"
+    ],
+    run_bench=True,
+)
+```
+
+#### Bench
+To run benchmark on your code with an existing patch:
+```
+from neural_coder import bench
+bench(
+    code="neural_coder/examples/vision/resnet50.py",
+    patch_path="${your_patch_path}",
+)
+```
+
+#### SuperBench
+To sweep on optimization sets with a fixed benchmark configuration:
+```
+from neural_coder import superbench
+superbench(code="neural_coder/examples/vision/resnet50.py")
+```
+To sweep on benchmark configurations for a fixed optimization set:
+```
+from neural_coder import superbench
+superbench(
+    code="neural_coder/examples/vision/resnet50.py",
+    sweep_objective="bench_config",
+    bench_feature=[
+        "pytorch_jit_script",
+        "pytorch_channels_last",
+    ],
+)
+```
diff --git a/neural_coder/docs/SupportMatrix.md b/neural_coder/docs/SupportMatrix.md
@@ -0,0 +1,18 @@
+Supported Optimization Features
+===========================
+
+| Framework | Optimization | API Alias |
+| ------------- | ------------- | ------------- |
+| PyTorch | [Mixed Precision](https://pytorch.org/docs/stable/amp.html) | `pytorch_amp` |
+| PyTorch | [Channels Last](https://pytorch.org/tutorials/intermediate/memory_format_tutorial.html) | `pytorch_channels_last` |
+| PyTorch | [JIT (Just-In-Time) Script/Trace](https://pytorch.org/docs/stable/jit.html) & [optimize_for_inference](https://pytorch.org/docs/stable/generated/torch.jit.optimize_for_inference.html) | `pytorch_jit_script`, `pytorch_jit_trace`, `pytorch_jit_script_ofi`, `pytorch_jit_trace_ofi` |
+| PyTorch | JIT with [TorchDynamo](https://github.com/pytorch/torchdynamo) | `pytorch_torchdynamo_jit_script`, `pytorch_torchdynamo_jit_trace`, `pytorch_torchdynamo_jit_script_ofi`, `pytorch_torchdynamo_jit_trace_ofi` |
+| PyTorch | [Intel Neural Compressor Mixed Precision](https://github.com/intel/neural-compressor/blob/master/docs/mixed_precision.md) | `pytorch_inc_bf16` | 
+| PyTorch | [Intel Neural Compressor INT8 Static Quantization (FX/IPEX)](https://github.com/intel/neural-compressor/blob/master/docs/PTQ.md) | `pytorch_inc_static_quant_fx`, `pytorch_inc_static_quant_ipex` |
+| PyTorch | [Intel Neural Compressor INT8 Dynamic Quantization](https://github.com/intel/neural-compressor/blob/master/docs/dynamic_quantization.md) | `pytorch_inc_dynamic_quant` |
+| PyTorch | [Intel Extension for PyTorch (FP32, BF16, INT8 Static/Dynamic Quantization)](https://github.com/intel/intel-extension-for-pytorch) | `pytorch_ipex_fp32`, `pytorch_ipex_bf16`, `pytorch_ipex_int8_static_quant`, `pytorch_ipex_int8_dynamic_quant` |
+| PyTorch | [Alibaba Blade-DISC](https://github.com/alibaba/BladeDISC) | `pytorch_aliblade` |
+| PyTorch Lightning | [Mixed Precision](https://pytorch-lightning.readthedocs.io/en/latest/guides/speed.html) | `pytorch_lightning_bf16_cpu` |
+| TensorFlow | [Mixed Precision](https://www.intel.com/content/www/us/en/developer/articles/guide/getting-started-with-automixedprecisionmkl.html) | `tensorflow_amp` |
+| Keras | [Mixed Precision](https://www.tensorflow.org/guide/mixed_precision) | `keras_amp` |
+| ONNX Runtime | [INC Static Quantization (QLinear)](https://github.com/intel/neural-compressor/blob/master/examples/onnxrt/README.md#operator-oriented-with-qlinearops) | `onnx_inc_static_quant_qlinear` |