Links
Tutorials / ressources
Machine learning
- https://blog.heuritech.com/2016/04/15/knowledge-extraction-from-unstructured-texts/
- https://www.datascience.com/blog/intro-to-anomaly-detection-learn-data-science-tutorials
- http://www.datasciencecentral.com/profiles/blogs/a-tour-of-machine-learning-algorithms-1
- https://karpathy.github.io/2015/05/21/rnn-effectiveness/
- https://medium.com/@ageitgey/machine-learning-is-fun-80ea3ec3c471
- https://en.wikipedia.org/wiki/Generalized_additive_model
- https://unsupervisedmethods.com/my-curated-list-of-ai-and-machine-learning-resources-from-around-the-web-9a97823b8524
- http://colah.github.io/
- https://github.com/jtpio/nn-from-scratch/blob/master/nn-from-scratch.ipynb
- http://cs231n.github.io/convolutional-networks/#conv
- https://github.com/cjbayesian/toyGANs/blob/master/GANs%20toys.ipynb
Statistics
- http://stats.idre.ucla.edu/sas/dae/multivariate-regression-analysis/
- http://nbviewer.jupyter.org/github/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/blob/master/Prologue/Prologue.ipynb
- https://www.newton.ac.uk/files/seminar/20110809093010301-152792.pdf
- http://sites.stat.psu.edu/~sesa/stat504/Lecture/lec3_4up.pdf
- https://en.wikipedia.org/wiki/Cohen's_kappa
- http://andrewgelman.com/2015/09/04/p-values-and-statistical-practice-2/
- http://bactra.org/weblog/1111.html
- https://en.wikipedia.org/wiki/Family-wise_error_rate
- https://en.wikipedia.org/wiki/False_discovery_rate
- https://en.wikiversity.org/wiki/Analysis_of_variance/Types
- https://www.stata.com/support/faqs/statistics/stepwise-regression-problems/
- https://www.slideshare.net/charthur/graduate-econometrics-course-part-4-2017?ref=http://freakonometrics.hypotheses.org/
- http://www.bmj.com/content/311/7003/485
- https://en.wikipedia.org/wiki/Survival_analysis
- https://en.wikipedia.org/wiki/Rank_correlation#General_correlation_coefficient
- http://lesswrong.com/lw/ev3/causal_diagrams_and_causal_models/
- https://kevinbinz.com/2015/01/21/average-causal-effect/
- http://www.statisticsviews.com/details/tools/13ae5fb1aa6/A-list-of-leading-and-interesting-blogs-to-follow.html
- http://freakonometrics.hypotheses.org/
- http://guides.library.duke.edu/c.php?g=289707&p=1930856
Natural language processing
- https://nlp.stanford.edu/software/lex-parser.html
- http://www.llf.cnrs.fr/Gens/Abeille/French-Treebank-fr.php
- http://stanfordnlp.github.io/CoreNLP/api.html
- https://nlp.stanford.edu/nlp/javadoc/javanlp-3.5.0/index.html?edu/stanford/nlp/dcoref/Document.html
- https://medium.com/@joshdotai/a-curated-list-of-speech-and-natural-language-processing-resources-4d89f94c032a
- http://universaldependencies.org/u/dep/all.html
- http://universaldependencies.org/u/pos/all.html
- http://universaldependencies.org/u/feat/all.html
- http://www.linguist.univ-paris-diderot.fr/~mcandito/Publications/crabbecandi-taln2008-final.pdf
- https://nlp.stanford.edu/IR-book/html/htmledition/latent-semantic-indexing-1.html
- https://gist.github.com/ttezel/4138642
- http://universaldependencies.org/u/dep/all.html
- http://universaldependencies.org/u/pos/all.html
- http://universaldependencies.org/u/feat/all.html
- http://www.linguist.univ-paris-diderot.fr/~mcandito/Publications/crabbecandi-taln2008-final.pdf
- https://nlp.stanford.edu/IR-book/html/htmledition/latent-semantic-indexing-1.html
Cheat sheets
- https://blogs.sas.com/content/subconsciousmusings/2017/04/12/machine-learning-algorithm-use/?utm_source=LINKEDIN_COMPANY&utm_medium=social-sprinklr&utm_content=895245317
- https://startupsventurecapital.com/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
- https://www.analyticsvidhya.com/blog/2016/12/cheatsheet-scikit-learn-caret-package-for-python-r-respectively/?utm_content=buffer3140b&utm_medium=social&utm_source=linkedin.com&utm_campaign=buffer
- https://www.analyticsvidhya.com/blog/2015/09/full-cheatsheet-machine-learning-algorithms/
- http://blog.minitab.com/blog/adventures-in-statistics-2/choosing-between-a-nonparametric-test-and-a-parametric-test
- https://www.healthknowledge.org.uk/public-health-textbook/research-methods/1b-statistical-methods/parametric-nonparametric-tests
Online courses (stats mostly):
- https://onlinecourses.science.psu.edu/stat504/node/180
- https://onlinecourses.science.psu.edu/stat504/node/113
- https://onlinecourses.science.psu.edu/stat502/node/141
- https://onlinecourses.science.psu.edu/stat504/node/49
- https://onlinecourses.science.psu.edu/statprogram/programs
- http://www.stat.ncsu.edu/people/bloomfield/courses/ST784/slides.html
- http://www.stat.ncsu.edu/people/bloomfield/courses/ST731/slides.html
- http://www.stat.ncsu.edu/people/bloomfield/courses/ST732/slides.html
Free books (disclaimer I have not read them all)
- http://www.deeplearningbook.org/
- http://neuralnetworksanddeeplearning.com/index.html
- http://www.bmj.com/about-bmj/resources-readers/publications/statistics-square-one
- http://www.biostathandbook.com/index.html
Books lists
- http://www.analyticsvidhya.com/blog/2015/10/read-books-for-beginners-machine-learning-artificial-intelligence/
- https://github.com/josephmisiti/awesome-machine-learning/blob/master/books.md
- https://www.reddit.com/r/MachineLearning/comments/1jeawf/machine_learning_books/
- http://blog.hackerearth.com/13-free-self-study-books-mathematics-machine-learning-deep-learning
- https://www.linkedin.com/groups/2013423/2013423-6273712822404902916?midToken=AQEUBTmPW2biTQ&trk=eml-b2_anet_digest_of_digests-hero-12-view%7Ediscussion&trkEmail=eml-b2_anet_digest_of_digests-hero-12-view%7Ediscussion-null-8z01vs%7Ej4dyv9ga%7Ex9-null-communities%7Egroup%7Ediscussion&lipi=urn%3Ali%3Apage%3Aemail_b2_anet_digest_of_digests%3BV1hBrDhkRoOiftaK%2B14IYg%3D%3D
- https://math.stackexchange.com/questions/94827/books-that-every-student-needs-to-go-through/94839#94839
Software / libraries
Python
- http://pystruct.github.io/
- http://www.skulpt.org/
- http://mc-stan.org/
- https://pypi.python.org/pypi/weighted-levenshtein/0.1
- http://seaborn.pydata.org/index.html
- https://github.com/scikit-learn/scikit-learn/tree/master/sklearn
- http://scikit-learn.org/stable/documentation.html
- http://www.nltk.org/
R
- https://rstudio.github.io/leaflet/
- https://github.com/ropensci/tesseract
- https://cran.r-project.org/web/packages/OneR/vignettes/OneR.html
SAS
- http://blog.sasanalysis.com/2013/11/kernel-selection-in-proc-svm.html
- http://blog.sasanalysis.com/?view=sidebar
- http://blogs.sas.com/content/iml/2012/05/14/how-to-read-data-set-variables-into-sasiml-vectors.html
- http://support.sas.com/kb/24/470.html
- https://v8doc.sas.com/sashtml/lrcon/z0998889.htm
- http://analytics.ncsu.edu/sesug/2007/SD06.pdf
- https://groups.google.com/forum/#!topic/comp.soft-sys.sas/XWMBgadeP_Q
- http://www.sascommunity.org/wiki/Proc_spell
- http://support.sas.com/documentation/cdl/en/lrcon/69852/HTML/default/viewer.htm#p0eaz2e63dlj17n1i5z17z3h84vp.htm
- http://support.sas.com/documentation/cdl/en/connref/61908/HTML/default/viewer.htm#a001249955.htm
- http://stats.idre.ucla.edu/sas/whatstat/what-statistical-analysis-should-i-usestatistical-analyses-using-sas/
- http://support.sas.com/resources/papers/proceedings10/158-2010.pdf
- http://support.sas.com/resources/papers/proceedings10/028-2010.pdf
- http://blogs.sas.com/content/iml/2017/02/15/confidence-intervals-multinomial-proportions.html
- http://support.sas.com/resources/papers/proceedings09/192-2009.pdf
VBA / SQL / MS Excel / MS Access
- http://www.functionx.com/vbaccess/Lesson01.htm
- http://www.piuha.fi/excel-function-name-translation/index.php?page=english-french.html
- https://technet.microsoft.com/en-us/library/ms189826(v=sql.90).aspx
- https://msdn.microsoft.com/en-us/library/office/mt346046.aspx
- https://msdn.microsoft.com/es-es/library/cc221403(v=vs.95).aspx
Others
Miscellaneous
- https://intellipaat.com/interview-question/r-interview-questions/
- http://www.bzarg.com/p/how-a-kalman-filter-works-in-pictures/
- https://rjlipton.wordpress.com/
- https://divisbyzero.com/blog-division-by-zero/
- https://bost.ocks.org/mike/algorithms/
- http://norvig.com/spell-correct.html
- http://theyougen.blogspot.fr/2010/02/faster-spelling-corrector.html
- http://commonmark.org/help/
- https://gist.github.com/dupuy/1855764
- https://pparacch.github.io/2017/07/14/plotting_in_R_ggplot2_part_2.html
- https://www.analyticsvidhya.com/blog/2014/11/text-data-cleaning-steps-python/
- https://1916letteranalysis.wordpress.com/category/data-processing/bad-data/
Questions
- https://stackoverflow.com/questions/38705359/how-to-give-sns-clustermap-a-precomputed-distance-matrix
- http://stackoverflow.com/questions/101268/hidden-features-of-python#3267903
- https://stackoverflow.com/questions/12332975/installing-python-module-within-code
- https://stackoverflow.com/questions/3898572/what-is-the-standard-python-docstring-format/24385103#24385103
- https://www.reddit.com/r/statistics/comments/16k9z6/can_anyone_help_me_understand_when_to_use/
- https://stats.stackexchange.com/questions/16390/when-to-use-generalized-estimating-equations-vs-mixed-effects-models
- https://stats.stackexchange.com/questions/17331/what-is-the-difference-between-generalized-estimating-equations-and-glmm
- https://stats.stackexchange.com/questions/103039/missing-at-random-data-in-gee
- https://stats.stackexchange.com/questions/217774/normal-distribution-necessary-for-linear-mixed-effects-r
- http://support.minitab.com/en-us/minitab/17/topic-library/modeling-statistics/multivariate/principal-components-and-factor-analysis/differences-between-pca-and-factor-analysis/
- https://stats.stackexchange.com/questions/35319/what-is-the-relationship-between-independent-component-analysis-and-factor-analy
- https://www.quora.com/What-is-the-difference-between-PCA-and-ICA
- https://stats.stackexchange.com/questions/123063/is-there-any-good-reason-to-use-pca-instead-of-efa-also-can-pca-be-a-substitut/123136#123136
- https://stats.stackexchange.com/questions/14002/whats-the-difference-between-principal-component-analysis-and-multidimensional/14017#14017
- http://stats.stackexchange.com/questions/31/what-is-the-meaning-of-p-values-and-t-values-in-statistical-tests
- https://stats.stackexchange.com/questions/76875/what-is-the-difference-between-mcnemars-test-and-the-chi-squared-test-and-how/141450#141450
- https://stats.stackexchange.com/questions/200500/asa-discusses-limitations-of-p-values-what-are-the-alternatives
- https://stats.stackexchange.com/questions/201146/what-is-a-good-convincing-example-in-which-p-values-are-useful
- https://stats.stackexchange.com/questions/26450/why-does-a-95-confidence-interval-ci-not-imply-a-95-chance-of-containing-the
- http://www.differencebetween.net/miscellaneous/difference-between-anova-and-t-test/
- https://stats.stackexchange.com/questions/14226/given-the-power-of-computers-these-days-is-there-ever-a-reason-to-do-a-chi-squa/14230#14230
- https://stackoverflow.com/questions/34997134/random-forest-tuning-tree-depth-and-number-of-trees
- https://stats.stackexchange.com/questions/53240/practical-questions-on-tuning-random-forests
- https://stats.stackexchange.com/questions/103500/machine-learning-algorithms-to-handle-missing-data
- http://stats.stackexchange.com/questions/47771/what-is-the-intuition-behind-beta-distribution
- http://stackoverflow.com/questions/22937618/reference-what-does-this-regex-mean/22944075#22944075
- http://stackoverflow.com/questions/4044919/opposite-of-an-inner-join-query
- https://math.stackexchange.com/questions/3869/what-is-the-intuitive-relationship-between-svd-and-pca/3871#3871
- https://stackoverflow.com/questions/3870088/a-monad-is-just-a-monoid-in-the-category-of-endofunctors-whats-the-proble%e2%85%bf/3870310#3870310
- https://stackoverflow.com/questions/27086195/linear-index-upper-triangular-matrix
- https://math.stackexchange.com/questions/94827/books-that-every-student-needs-to-go-through/94839#94839