Binary Imbalanced Learning A practical Approach in R

 Introduction and motivation Binary classification problem is arguably one of the simplest and most straightforward problems in Machine Learning. Usually we want to learn a model trying to predict whether some instance belongs to a class or not. It has many practical applications ranging from email spam detection to medical testing (determine if a patient has a certain disease or not). Slightly more formally, the goal of binary classification is to learn a function f(x) that map x (a vector of features for an instance/example) to a predicted binary outcome ŷ (0 or 1). Most classification algorithms, such as logistic regression, Naive Bayes and decision trees, … Continue reading Binary Imbalanced Learning A practical Approach in R

Data Visualization With ggplot2 :Understanding the grammar and practical approach

Data visualization is far most important thing in your data science or data analytics journey. It is the visualization that attract the viewers to see your work that impress the shareholder to invest and the authority to give a positive review towards your work but correct representation of data is not that simple not only you required to have a solid foundation on visualization tools but you needs to keep an eye on the variables you used , understanding the relationship between and foremost understanding the visualization graphics to establish your finding. In this post I am going to cover … Continue reading Data Visualization With ggplot2 :Understanding the grammar and practical approach

Understand Basic to Advance Data Structure Used in R to use it Efficiently

Understand Basic to Advance Data Structure Used in R to use Efficiently Data structures You’ve probably used many (if not all) of them before, but you may not have thought deeply about how they are interrelated. In this brief overview, I’ll show you how they fit together as a whole. If you need more details, you can find them in R’s documentation. R’s base data structures can be organised by their dimensionality (1d, 2d, or nd) and whether they’re homogeneous (all contents must be of the same type) or heterogeneous (the contents can be of different types). This gives rise … Continue reading Understand Basic to Advance Data Structure Used in R to use it Efficiently