stopwords.txt 368 B

1234567891011121314151617181920212223242526272829
  1. # 这是一个索引器忽略的单词列表,每行一个单词
  2. # 这个文件需要使用UNIX行结尾(单换行符)
  3. # 不需要包括短于3字符的词,这些都会被忽略
  4. # 这个列表是基于此改进的 http://www.ranks.nl/stopwords/
  5. about
  6. are
  7. and
  8. you
  9. your
  10. them
  11. their
  12. com
  13. for
  14. from
  15. into
  16. how
  17. that
  18. the
  19. this
  20. was
  21. what
  22. when
  23. where
  24. who
  25. will
  26. with
  27. und
  28. the
  29. www