This paper reports on preliminary steps to create an external plagiarism detection tool. I used the PAN-PC-11 data sets and extracted tf-idf scores of text documents and cosine similarity measures between source and suspicious documents to find text overlap. The model was able to successfully create vectors and measure the similarity metrics. https://www.markbroyard.com/best-catch-acure-ultra-hydrating-plant-ceramide-daily-facial-lotion-50ml-fashion-great-buy/