基于AC自动机的中文分词、脏词过滤

Discussion in '开发交流 | Development' started by jinnblue, Aug 3, 2016.

  1. jinnblue

    jinnblue Member

    Country:
    China
    May 18, 2015
    用Delphi Berlin 实现了一个Aho-Corasick(AC自动机)中文分词和脏词过滤;
    另外实现了AC自动机结合双数组的脏词过滤,词组较多(大于1W)的时候,初始化较久,待优化;
    放在GitHub,有兴趣的可以看看:
    Hidden Content:
    You must reply before you can see the hidden data contained here.
     
    solover, hellowy, c5soft and 7 others like this.
  2. 33

    33152811 Member

    Aug 17, 2016
    bucuo kanxia
     
  3. ja

    jasondelphi Member

    Country:
    China
    Nov 19, 2016
  4. xa

    xaccc Member

    Country:
    China
    May 24, 2015
  5. bj

    bjabc Member

    Country:
    China
    Jul 8, 2015
    厉害啊!
     
  6. yr

    yrry Member

    Country:
    Russian Federation
    Jun 27, 2015
  7. he

    hellowy Member

    Country:
    China
    Dec 11, 2015
  8. so

    solover Member

    Country:
    China
    Dec 24, 2015