====== Text mining ====== * R package [[http://cran.r-project.org/web/packages/koRpus/koRpus.pdf|koRpus]] * R package [[http://cran.r-project.org/web/packages/tm/vignettes/tm.pdf|tm - text mining]] * RDM: [[http://www.rdatamining.com/examples/text-mining|R and Data Mining]] - twitter * [[http://www.r-bloggers.com/text-mining-in-r-automatic-categorization-of-wikipedia-articles/|Text mining in R – Automatic categorization of Wikipedia articles]] * [[http://onepager.togaware.com/OnePageR|OnePageR]] - A Survival Guide to Data Science with R; [[http://onepager.togaware.com/TextMiningO.pdf|Text mining]] * [[https://rstudio-pubs-static.s3.amazonaws.com/31867_8236987cf0a8444e962ccd2aec46d9c3.html|Basic Text Mining in R]] * [[https://deltadna.com/blog/text-mining-in-r-for-term-frequency/|Text mining in R for term frequency]] ===== Books ===== * Peter Christen: Data matching: concepts and techniques for record linkage, entity resolution, and duplicate detection. Data-centric systems and applications. Springer 2012. * John Talburt: Entity Resolution and Information Quality. Elsevier 2011. *