Lexically Grounded Subword Segmentation (LEGROS) Word-embedding-based subword segmentation algorithm with bigrams! Building C++ code You can build the project with cmake by following the standard routine: mkdir build cd build cmake .. make