Skip to content

Commit

Permalink
Merge pull request #116 from go-ego/range-pr
Browse files Browse the repository at this point in the history
Update README.md
  • Loading branch information
vcaesar authored Sep 19, 2021
2 parents c129d56 + 907e9ab commit 0c45dbf
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 6 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# gse

Go efficient text segmentation; support english, chinese, japanese and other. And supports with elasticsearch.
Go efficient NLP and text segmentation; support english, chinese, japanese and other. And supports with elasticsearch.

<!--<img align="right" src="https://raw.githubusercontent.com/go-ego/ego/master/logo.jpg">-->
<!--<a href="https://circleci.com/gh/go-ego/ego/tree/dev"><img src="https://img.shields.io/circleci/project/go-ego/ego/dev.svg" alt="Build Status"></a>-->
Expand All @@ -20,9 +20,9 @@ Go efficient text segmentation; support english, chinese, japanese and other. An
[简体中文](https://github.com/go-ego/gse/blob/master/README_zh.md)

<a href="https://github.com/go-ego/gse/blob/master/dictionary.go">Dictionary </a> with double array trie (Double-Array Trie) to achieve,
<a href="https://github.com/go-ego/gse/blob/master/segmenter.go">Sender </a> algorithm is the shortest path based on word frequency plus dynamic programming, and DAG and HMM algorithm word segmentation.
<a href="https://github.com/go-ego/gse/blob/master/segmenter.go">Segmenter</a> algorithm is the shortest path (based on word frequency and dynamic programming), and DAG and HMM algorithm word segmentation.

Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes, support user dictionary, POS tagging, run<a href="https://github.com/go-ego/gse/blob/master/server/server.go"> JSON RPC service</a>.
Support common, search engine, full mode, precise mode and HMM mode multiple word segmentation modes; support user and embed dictionary, POS tagging, analyze segment info, stop and trim, run<a href="https://github.com/go-ego/gse/blob/master/server/server.go"> JSON RPC service</a>.

Support HMM cut text use Viterbi algorithm.

Expand Down
6 changes: 3 additions & 3 deletions README_zh.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# [gse](https://github.com/go-ego/gse)

Go 语言高效分词, 支持英文、中文、日文等, 支持接入 elasticsearch
Go 语言高性能 NLP 和分词, 支持英文、中文、日文等, 支持接入 elasticsearch

<!--<img align="right" src="https://raw.githubusercontent.com/go-ego/ego/master/logo.jpg">-->
<!--<a href="https://circleci.com/gh/go-ego/ego/tree/dev"><img src="https://img.shields.io/circleci/project/go-ego/ego/dev.svg" alt="Build Status"></a>-->
Expand All @@ -21,9 +21,9 @@ Go 语言高效分词, 支持英文、中文、日文等, 支持接入 elasticse

支持 HMM 分词, 使用 viterbi 算法.

支持普通、搜索引擎、全模式、精确模式和 HMM 模式多种分词模式,支持用户词典、词性标注,可运行<a href="https://github.com/go-ego/gse/blob/master/server/server.go"> JSON RPC 服务</a>。
支持普通、搜索引擎、全模式、精确模式和 HMM 模式多种分词模式,支持用户词典、embed 词典、词性标注、停用词和整理分析分词,可运行<a href="https://github.com/go-ego/gse/blob/master/server/server.go"> JSON RPC 服务</a>。

分词速度<a href="https://github.com/go-ego/gse/blob/master/benchmark/benchmark.go">单线程</a> 9.2MB/s,<a href="https://github.com/go-ego/gse/blob/master/benchmark/goroutines/goroutines.go">goroutines 并发</a> 26.8MB/s. HMM 模式单线程分词速度 3.2MB/s.( 双核 4 线程 Macbook Pro)。
分词速度<a href="https://github.com/go-ego/gse/blob/master/benchmark/benchmark.go">单线程</a> 9.2MB/s,<a href="https://github.com/go-ego/gse/blob/master/benchmark/goroutines/goroutines.go">goroutines 并发</a> 26.8MB/s. HMM 模式单线程分词速度 3.2MB/s.(双核 4 线程 Macbook Pro)。

## Binding:

Expand Down

0 comments on commit 0c45dbf

Please sign in to comment.