Friday, April 8, 2022

 Koefisien Jaccard

 RENANCY LYANA SARASWATY

15119430

3KA24

SOAL

1. Jika diketahui A={1,2,3,4}, B={1,2,4}, dan C={1,2,4,5}, berapakah Jaccard (A,B), Jaccard(B,C), dan Jaccard(A,C) ?

2. Berikutnya untuk kasus query dan document. Misalnya kita punya:

query: ideas of march

doc1: caesar died in march

doc2: the long march

Cari Koefisien Jaccard antara query dengan doc1 dan doc2.

3. Diketahui 3 dokumen :

d1: “Jack London traveled to Oakland”

d2: “Jack London traveled to the city of Oakland”

d3: “Jack traveled from Oakland to London”

Nilai dari Koefisian Jaccard J(d1,d2) dan J(d1,d3) jika dilakukan dengan n-gram analisis dengan n=2 (bigram) adalah:

 

JAWABAN 


 

2. Dik : Query ideas of march

doc1 : caesar diead in march

doc2 : the lon march 

Dit : Koefisien Jaccard antara query dengan doc1 dan doc2.


 3. Jaccard(D1,D2)

D1 = 4 (Jack London, London traveled, traveled to, to Oakland)

D2 = 7 (Jack London, London traveled, traveled to, to the, the city, city of, of Oakland)

| D1 ∩ D2 | = 3

| D1 U D2 | = 8

| D1 ∩ D2 | / | D1 U D2 | = 3/8 = 0.375

Jaccard(D1,D3)

D1 = 4 (Jack London, London traveled, traveled to, to Oakland)

D2 = 5 (Jack traveled, traveled from, from Oakland, Oakland to, to London

| D1 ∩ D3 | = 0

| D1 U D2 | = 9

| D1 ∩ D2 | / | D1 U D2 | = 0/9 = 0

 

 

No comments:

Post a Comment

  TUGAS AUDIT (SOFTWAERE GAS )   1.       1  ACL (Audit Command Language) ACL adalah sebuah software yang dirancang secara khus...