Learn R and Python in Parallel
Maybe a major reason is an existential crisis.
The feedback from readers is another important reason. A few months ago I submitted a git repo with three Chapters of this book in PDF format to Hacker News, and surprisingly the repo got 500 stars in a week. I received a few emails expressing thanks and interests in more Chapters. Since then, I have been working on this project constantly.
There has been considerable debate over choosing R vs. Python for Data Science. I started to learn Python when I was an undergraduate. At that time I never heard of Data Science. A few years later I read an R script for the first time. Since then R had been my primary programming language for quite a while during my Ph.D. study. I also used to learn new programming languages as a hobby. Based on my limited knowledge/experience, both R and Python are great languages and are worth learning; so why not learn them together?
Besides the side-by-side comparison of the two popular languages used in Data Science, this book also focuses on the translation from mathematical models to codes. In the book, the audience could find the applications/implementations of some important algorithms from scratch, such as maximum likelihood estimation, inversion sampling, copula simulation, simulated annealing, bootstrapping, linear regression (lasso/ridge regression), logistic regression, gradient boosting trees, etc.
The code can be found at this git repo. If you have any idea to share or find any errors of the book, please contact me directly via email firstname.lastname@example.org.
A PDF version of this book would be available soon!