Content of the user web page is extracted and tokenized. TFIDF is done and found similarity value. 0.4(approx) is strongest.
kaushikdata / cosinesimofwebpage Goto Github PK
View Code? Open in Web Editor NEWContent of the user web pages is extracted using URL and tokenized. TFIDF is done and found similarity value. 0.4(approx) is strongest.