All the articles tagged under data

Building a Churn Model - an intern's perspective

With “Tech” and “Data” being the newest buzzwords, I was more excited about my summer internship as a Data Scientist at Viki. Coined as the “Sexiest Job of the 21st Century” by the Harvard Business Review, I was beyond eager to swim in data, and add as much value to the Viki Team.

I started of by getting introduced to the technologies- Hadoop, particularly AWS EMR, Hive, Tez, PostgreSQL, Redshift...

Continue →

Analytics Infrastructure Update - Building Our Own Hadoop Cluster

Analytics Processing Data Flow Diagram

Since our last blog post about the Viki analytics infrastructure, a lot of improvements has been implemented. With the old blog post being outdated, we’ve decided to make a follow-up blog post to share some of the changes.

Overview (or TL;DR)

We switched to a more flexible Hadoop solution. We were using Treasure Data, but we felt that we have outgrown the service and we wanted to do more. So...

Continue →

Data Warehouse and Analytics Infrastructure at Viki

Update: We have published a follow-up post after this, outlining how we build our own Hadoop cluster here

At Viki, we use data to power product, marketing and business decisions. We use an in-house analytics dashboard to expose all the data we collect to various teams through simple table and chart based reports. This allows them to monitor all our high level KPIs and metrics regularly. 


Continue →