160 likes | 209 Views
Explore two variants of Sketch Grammar - Classic and V. Benko's Approach for precise results and less noise in word sketches.
E N D
Russian Word Sketches Khokhlova Maria St.Petersburg State University Institute for Linguistic Studies khokhlova.marie@gmail.com
Russian Web Corpus, S. Sharoff; • 10 mln tokens (sample); • Russian National Corpus isn’t available in the Sketch Engine; • 2 Sketch grammars: • “Classic” grammar; • V. Benko’s grammar
Verb X/X Verb 2:[tag="V.*"] [tag!="Z"&tag!="SENT"]{0,2} 1:[tag!="SENT"&tag!="Z"&tag!="S.*"&tag!="I"] 1:[tag!="SENT"&tag!="Z"&tag!="S.*"&tag!="I"] [tag!=","&tag!="SENT"]{0,2} 2:[tag="V.*"]
“Classic” Approach: precise results, less noise. • V.Benko’s Approach: word sketches are generated for any word, important in the case of mistakes in corpora.