Index ¦ Archives

CAPSTONE PROJECT : Factors that have more incidence in the breast cancer diagnosis after a regular mammogram.

Logo

Overview

Breast cancer is the most common cause of deaths from cancer among women in the United States.

According to the American Cancer Society statistics 2016, it is estimated that almost 1.7 million new cases of cancer will be diagnosed in 2016. Prostate cancer is the most common cancer ...


Airport Delays Analysis using Clusters

Logo

Overview

In this project, we will use three different datasets related to airport operations. These include a dataset detailing the arrival and departure delays/diversions by airport, a dataset that provides metrics related to arrivals and departures for each airport, and a dataset that details names and characteristics for each ...


IMDb API + Random Forests

Logo

Overview

This week we've learned about ensemble methods and APIs. We will acquire data from IMDb, and use the collected metrics to predict whether a movie is highly rated or no. We will produce a report detailing our findings including next steps recommendations.

Problem Statement

I have been hired ...


Project 5: Disaster Relief + Classification

Logo

Overview

This week we worked with remote databases, and more advanced topics for conducting logistic regression.

We are going to create, train and evaluate a logistic regression model for disaster analysis using AWS PostgreSQL instance via Python.

In this project, we'll be using data on passengers from the 1912 ...


Project 4: Web Scraping and Logistic Regression - Predict Salaries

Group Members: Peida Cai, Betsy Zimmermann, and Maria Pichardo

Overview

This week we worked with web scraping and logistic regression. In this project, we will practice two major skills. Collecting data by scraping a website and then building a binary predictor with Logistic Regression.

We are going to collect salary ...

© 2016 Maria Pichardo. Built using Pelican. Theme by Giulio Fidente on github.