Real-Time Kafka Streaming: Twitter Hashtags #what #is #happening ?

Hello, I’m back! I know, I know, my last blog was from exactly 3 years ago. But it’s never too late right? Streaming has always been an area that I was interested in, but I’ve never used it too much in the past. Today is the day! Let’s set up Kafka step by step, and … More Real-Time Kafka Streaming: Twitter Hashtags #what #is #happening ?

Find Twitter Influencers Through Social Network Analysis

Check out the DEMO R SHINY DASHBOARD! This is an open source project collaborated between NYC Data Science Academy and Fusion media (an ABC-Univision joint-venture). Project Team: –NYC Data Science Academy Fangzhou Cheng (PM, Data Science Fellow) Shu Yan (Data Science Fellow) Alex Singal (Data Science Fellow) –Fusion Noppanit Charassinvichai (Data Engineer) Introduction: This project designs … More Find Twitter Influencers Through Social Network Analysis

Theatre Event DBMS Retrofit Analysis and Design – Oracle 12c

Click to read the full report: Fangzhou Cheng Database Final Project Event Management Database Retrofit Analysis and Design Database Design and Management Class Final Project NYU MASY-GC-2500 Fall 2014 Instructor: Marc S. Paller Submitted By: Fangzhou Cheng Submitted On: October 20, 2014 Project source code: https://github.com/funjo/BPAC_database_retrofit I. Overview A. Background information Baruch Performing Arts Center (BPAC) is … More Theatre Event DBMS Retrofit Analysis and Design – Oracle 12c

The Most Dangerous Intersections in NYC – Interactive Data Visualization in R

Why visualize an accident map? As a Citi Bike annual member, I’ve always been proud of contributing to the poor 21% female rider ratio (source). But at the same time, I constantly got warnings from friends and family, “Streets in NYC are crazy! Biking is a bad idea!” How many biking accidents have actually happened compared to other vehicles? Where are … More The Most Dangerous Intersections in NYC – Interactive Data Visualization in R

Okcupid Scraper – Who is pickier? Who is lying? Men or Women?

Introduction: 40 million Americans indicated that they used online dating services at least once in their life (source), which got my attention — Who are these people? How do they behave online? Demographics analysis (age and location distribution), along with some psychological analysis (who are pickier? who are lying?) are included in this project. Analysis is … More Okcupid Scraper – Who is pickier? Who is lying? Men or Women?

NYC Oil Boilers – Detailed Fuel Consumption and Building Data

NYC Open Data Portal Working Through CSV Source Data Oil-burning boilers are one of the largest sources of air pollution in NYC. The purpose of the project is to see how complete this dataset is and to map the data in a geographic visualization. The original data is small-sized (~22MB, 8K rows), and appropriately enough, … More NYC Oil Boilers – Detailed Fuel Consumption and Building Data

Acme Gallery Responsive Website (WordPress)

Acme Gallery Website a demo responsive wordpress website I built for Application Architecture Design and Development class at NYU during my graduate study in Management Information Systems. The purpose was to showcase the look and feel of the website in response to our client Acme Gallery’s RFP(Request for Proposal). This demo site, a formal written proposal, … More Acme Gallery Responsive Website (WordPress)