Portfolio of Projects and Competitions

These projects are a mix of my work in machine learning competitions or course projects in the Data Science Specialization. The projects cover a range of technical capabilities from simple regression to stacking multiple algorithms to make a single prediction. Go ahead and browse through them- I’d love to get your feedback!

Lending Club

Maximizing peer-to-peer lending returns using data science. The project is organized in three R packages:

  • LendingClub: Set of functions with bindings to the API enabling data access and transacting via R.
  • LendingClubModel: An exploratory analysis and a predictive model (Work in progress)
  • LendingClubAccess: Once the model is complete, the third component will be a Shiny App applying the insights to live results

Project Page

Random Acts of Pizza

Reddit hosted a program allowing people to ask others to buy them pizza. Not everyone will get free pizza. The objective of this competition was to develop a model and understand what features will lead to a successful request. I've wrapped this analysis into a package and used vignettes to document my workflow.

Project Page

Analyzing Credit Approval Decisions

A capstone project for grad school. Applying various analytical techniques in a case study of a lender's underwriting model with the intent of determining if credit is extended based on risk.

Project Page

LabCorp

Creating a database of service locations by scraping the LabCorp website. These locations were useful in portraying Quest Diagnostic's value proposition over its largest competitor.

Project Page

NJ Payroll

Course project to compare your salary to NJ state employees

Project Page

Shiny App

Northwind

Data tables useful for demonstrating SQL

Project Page

R package

Home Depot

Improving rhe quality of product search results

Github

Competition

NY Times

Predict whether NY Times blog posts will be successful

Project Page

Competition

Ortho

Formatting extracts from the SAP ledger system

Github

R Package

DGX

Formatting extracts from the data warehouse

Github

R Package

Moocs

Pedict which particiants will dropout from MOOCs

Github

Competition

Walmart

Predict sales after weather events

Github

Competition

Africa Soil

Infer key soil properties given other attributes

Github

Competition

Titanic

Competiion to predict which passengers survived

Github

Competition