Skip to content

lukegonsalves/entropy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

entropy

entropy is a C++ tokenisation library built off of information theory to generate efficient tokenisation schemes. Currently Huffman and Shannon Byte-Pair encoding algorithms.

Build from source

Requires >=CMake3.10.0

mkdir entropy/build
cd entropy/build
cmake ..
make nn```


About

C++ tokenisation - efficient embedding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published