Koefisien Jaccard
RENANCY LYANA SARASWATY
15119430
3KA24
SOAL
1. Jika diketahui A={1,2,3,4}, B={1,2,4}, dan C={1,2,4,5}, berapakah Jaccard (A,B), Jaccard(B,C), dan Jaccard(A,C) ?
2. Berikutnya untuk kasus query dan document. Misalnya kita punya:
query: ideas of march
doc1: caesar died in march
doc2: the long march
Cari Koefisien Jaccard antara query dengan doc1 dan doc2.
3. Diketahui 3 dokumen :
d1: “Jack London traveled to Oakland”
d2: “Jack London traveled to the city of Oakland”
d3: “Jack traveled from Oakland to London”
Nilai dari Koefisian Jaccard J(d1,d2) dan J(d1,d3) jika dilakukan dengan n-gram analisis dengan n=2 (bigram) adalah:
JAWABAN
2. Dik : Query ideas of march
doc1 : caesar diead in march
doc2 : the lon march
Dit : Koefisien Jaccard antara query dengan doc1 dan doc2.
3. Jaccard(D1,D2)
D1 = 4 (Jack London, London traveled, traveled to, to Oakland)
D2 = 7 (Jack London, London traveled, traveled to, to the, the city, city of, of Oakland)
| D1 ∩ D2 | = 3
| D1 U D2 | = 8
| D1 ∩ D2 | / | D1 U D2 | = 3/8 = 0.375
Jaccard(D1,D3)
D1 = 4 (Jack London, London traveled, traveled to, to Oakland)
D2 = 5 (Jack traveled, traveled from, from Oakland, Oakland to, to London
| D1 ∩ D3 | = 0
| D1 U D2 | = 9
| D1 ∩ D2 | / | D1 U D2 | = 0/9 = 0
No comments:
Post a Comment