字袋模型

出自維基百科,自由嘅百科全書
跳去導覽 跳去搵嘢

字袋模型英文bag-of-words model,BoW model)係自然語言處理資訊提取入面嘅一種做法,指嘅係將一段文字當做由啲組成嘅多重集,忽略文法甚至啲字嘅次序。

例如以下呢句嘢:

John likes to watch movies. Mary likes movies too.

用 BoW 方法表示嘅話會變成噉:

"John","likes","to","watch","movies","Mary","likes","movies","too"

睇埋[編輯]