refactor quant table #3911

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open

tpoisonooo wants to merge 18 commits into Tencent:master from tpoisonooo:ncnn-int8-toml

docs/how-to-use-and-FAQ/quantized-int8-inference.md

            
                      Original file line number
                      Diff line number
                      Diff line change
                  
    @@ -20,7 +20,7 @@ Some imagenet sample images here https://github.com/nihui/imagenet-sample-images
  
    ```shell

    find images/ -type f > imagelist.txt

    ./ncnn2table mobilenet-opt.param mobilenet-opt.bin imagelist.txt mobilenet.table mean=[104,117,123] norm=[0.017,0.017,0.017] shape=[224,224,3] pixel=BGR thread=8 method=kl

    ./ncnn2table mobilenet-opt.param mobilenet-opt.bin imagelist.txt mobilenet.table mean=[104,117,123] norm=[0.017,0.017,0.017] shape=[224,224,3] pixel=BGR thread=8 method=kl format=txt

    ```

    * mean and norm are the values you passed to ```Mat::substract_mean_normalize()```

    @@ -35,6 +35,7 @@ find images/ -type f > imagelist.txt
  
    * pixel is the pixel format of your model, image pixels will be converted to this type before ```Extractor::input()```

    * thread is the CPU thread count that could be used for parallel inference

    * method is the post training quantization algorithm, kl and aciq are currently supported

    * format is the output file type of quantization parameters, choose `ini` for `txt`. Using `txt` by default

    If your model has multiple input nodes, you can use multiple list files and other parameters

    @@ -60,7 +61,7 @@ mobilenet.load_model("mobilenet-int8.bin");
  
    ## mixed precision inference

    Before quantize your model, comment the layer weight scale line in table file, then the layer will do the float32 inference

    Before quantize your model, comment layer weight scale line in the table file with `txt` format, then the layer will do the float32 inference

    ```

    conv1_param_0 156.639840536

    @@ -69,3 +70,26 @@ conv1_param_0 156.639840536
  
    ```

    #conv1_param_0 156.639840536

    ```

    If you are using `ini` format, just remove whole quantization parameters of the layer, for example:

    ```

    [conv0]

    type = "Conv"

    weight = [ 156.639840536 ]

    input_scale = 1.23

    [fire]

    type = "Gemm"

    weight = [ 156.639840536 ]

    input_scale = 1.23

    ```

    to

    ```

    [fire]

    type = "Gemm"

    weight = [ 156.639840536 ]

    input_scale = 1.23

    ```

tools/CMakeLists.txt

-Original file line number
+Diff line change
@@ Expand Up / @@ -26,7 +26,7 @@ if(NCNN_VULKAN) @@
         target_link_libraries(ncnn2mem PRIVATE ${Vulkan_LIBRARY})
     endif()
-    add_executable(ncnnoptimize ncnnoptimize.cpp)
+    add_executable(ncnnoptimize ncnnoptimize.cpp modelwriter.cpp)
     target_link_libraries(ncnnoptimize PRIVATE ncnn)
     if(NCNN_VULKAN)
         target_link_libraries(ncnnoptimize PRIVATE ${Vulkan_LIBRARY})
@@ Expand Down @@

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor quant table #3911

Uh oh!

Diff view

Diff view

There are no files selected for viewing

Uh oh!

refactor quant table #3911

Are you sure you want to change the base?

Uh oh!

refactor quant table #3911

Uh oh!

Uh oh!

Diff view

Diff view

There are no files selected for viewing

Uh oh!