Welcome to the Capstone Project for Big Data! In collaboration with Splunk, a software company focused on analyzing machine-generated big data, you will build a Big Data ecosystem using tools and methods from the earlier courses in this specialization. First, you can choose an application: retail, sports, current events, etc. Then, you will enrich the datasets / data models we’ve already used in this specialization with external data sets of your choice. After bringing in data from at least three distinct sources, you will build searches and/or dashboards that address the Capstone Project questions. In this project, you will use a post-process search to aggregate or otherwise transform the data and extract meaningful insights. By utilizing visualization and communication techniques for Big Data, you will be able to conduct basic storytelling and model interpretation. Excitingly, if you have a top project (whether Hadoop or Splunk), you will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership!
At the University of California San Diego, we prefer the path less traveled. And it has led us to remarkable new ways of seeing and making a difference in the world.
Recognized as one of the top 15 research universities worldwide, our culture of collaboration sparks discoveries that advance society and drive economic impact. Everything we do is dedicated to ensuring our students have the opportunity to become changemakers, equipped with the multidisciplinary tools needed to accelerate answers to our world’s most pressing issues.