-
Label points based on the polygon they fall in
Problem Let’s say you have a dataset (dataframe) that has coordinates of locations where a certain species was observed take for example Striped Kingfisher species. The observations were done for a given Country take for example Kenya which has 47 Counties (second level adminstrative boundaries). You may want to know the total number of observa... Read More
-
Automating Gpm Data Download And Manipulation
Precipitation data is one of the highly sort satelite data owing to its contribution to agriculture, energy and climate related studies. The data can be obtained from gpm through google earth engine. A good example of its use is in climate/weather analyses/forecasts and this often times necessitates the need to access the data frequently or hav... Read More
-
Support Vector Regression using R
SVM(Support Vector Machine) Support Vector Machine is a set of supervised machine learning algorithms used to solve classification and regression problems. SVM’s can be further categorized into two types: SVR (Support vector regression) : for solving regression problems. SVC (Support vector classification): for solving classification prob... Read More
-
Too many outliers? - Winsorization
In preparing your data for machine learning you’ll often come across features with values that are detached from the rest, we call them outliers. Such values are a point of pain to most models as they are sensitive to outliers. Well, regression algorithms such as Robust regression, XGBoost Regressors and some more others are said to be ‘robust’... Read More
-
Analysing salaries for United States using R
The objective of this article is to answer below questions that touch on salaries in United States of America. The idea is to understand the various factors that determine differences in salaries for professionals. Which industry pays the most? How do the salaries of men and women compare? Does an increase in years of experience lead to ... Read More
-
Data cleaning
It a widely known fact that a data scientist spends more than 80% of a data science project time cleaning up and preparing data for analysis and modelling, as such, its indispensable for a data scientist to have a good understanding and practical skills on data cleaning and munging. To that effect, this article seeks to practically apply some fu... Read More
-
ALIVE(K) Missions Resource Hub
Hi, Welcome to ALIVE (K) Missions Resource Hub. ALIVE (K) is a movement of Seventh-Day Adventist students and young professionals living in view of eternity. ALIVE(K) aims to inspire, train and mobilize Seventh-Day Adventist young adults for effective involvement in evangelism, directly impacting the African Continent. The movement conducts mi... Read More
-
Deploying dockerized shiny apps on Heroku
Heroku is a platform as a service (PaaS) that enables developers to build, run, and operate applications entirely in the cloud. They offer free services where you can run up to 5 apps but if you verify your account with a credit card you can get more than 5. Get to the sign up page and create an account.After you’re done you can now follow the ... Read More
-
Introduction to shinyproxy
Shinyproxy is an open-source system that makes it possible to deploy dockerized applications. The beauty of deploying your application with shinyproxy is that it gives you enterprise features in an open-source context. Some of the gains of using shinyproxy system are: workspace isolation for each app session. you can have as many users as... Read More
-
Connecting docker container to mysql database
You have containerized your application and you could be wondering how to link it to a database (mysql for example), be it locally or on some remote server. Well, I hope this simple guide will be of help to you. Assuming you already have mysql installed (if not please check here), you will need to make some adjustments on configurations of user... Read More
-
Docker Introduction with R
Docker technology has become one of the highly used technologies in software development world, developers are now able to develop, replicate and ship containerized application(s). The system works in such a way that the container runs an isolated environment from the host machine. A closely related way of doing it would be through spinning a vi... Read More
-
Gentle Introduction to R Shiny
Shiny is an R framework that makes it easy to develop web applications without necessarily knowing HTML. The skill will come in handy in situations where you want to share your R model to people as an interactive dashboard or maybe perform an exploratory data analysis that has interactive visualizations on it and many other use cases. knitr:... Read More
-
Data visualization
Data visualization is a branch of descriptive statistics that helps us to get insights from data. With graphics, we can explore the data to try and unpack underlying patterns which might be hidden in the data, as such, not easily perceivable just by plainly looking at the data. After you’ve cleaned and wrangled your data, as a data scientist, y... Read More
-
Ikea furnitures EDA
I am loving the tidytuesday challenges cause of the way they stretch my thinking particularly on data visualization skills. Took some time to flex my EDA skills with the TidyTuesday challenge data from Ikea home furnishing company. There are quite a number of things to inspect but I’ll just have a look at a few. Data import ikea <- readr::r... Read More