Gadepally Guest Lecture: "Data management tools to enable complex applications"
Sep15
Time: 4 p.m.
Date: September 15, 2017
Location: 480 Dreese Labs, 2015 Neil Ave.
Invitees: Faculty, postdocs, students, staff

Dr. Vijay Gadepally will present a guest lecture entitled “Data management tools to enable complex applications.”

About the speaker: Dr. Vijay Gadepally is a senior member of the technical staff at MIT Lincoln Laboratory and CSAIL. Vijay’s research is in the area of high performance computing, big data/IoT systems, security, analytics, and advanced database technologies. He holds a PhD in Electrical and Computer Engineering from The Ohio State University and a BTech degree in Electrical Engineering from the Indian Institute of Technology (IIT), Kanpur. In 2017, Vijay received the Early Career Technical Achievement Award at MIT Lincoln Laboratory and was named to AFCEA’s inaugural 40 under 40 list. In 2011, Vijay received an Outstanding Graduate Student Award at The Ohio State University. Vijay has also worked at Raytheon Company and Rensselaer Polytechnic Institute.

Abstract: Applying the latest and greatest machine learning algorithm to your data science problem is fun! Unfortunately, life rarely gives us datasets that are curated and well defined enough to directly apply many of the great tools being developed by the wider community. One often spends the first few weeks (sometimes months) trying to figure out how to store data, deal with inconsistencies in the dataset and determine which algorithm (or set of algorithms) will be best suited for their application.  Larger, faster and messier datasets such as those from IoT sensors, medical devices or autonomous vehicles only compound these issues. These challenges, often referred to as the 3 V’s of Big Data, require new tools for data management and data cleaning/pre-processing.

In this talk I will describe a few tools developed at MIT’s Lincoln Laboratory and CSAIL to address these challenges. The first is the BigDAWG polystore system. The BigDAWG system allows users to mix and match database technologies in order to support diverse data management operations. For example, in a single application, one may have pieces of a dataset in a relational database, a key-value store and an array database. This allows users to develop complex analytics that leverage highly efficient underlying data stores. The second tool I will discuss is Graphulo – a toolbox that allows people to perform graph operations directly in key-value store databases such as Apache Accumulo. For both, I will describe performance and future research avenues.

Share this page
Upcoming Events
Talk: "Creating Computing Programs for Underrepresented Girls and Women"

4-6 p.m.
110 Ramseyer Hall

Resource Workshop for New Faculty, Session 4: Funding & Proposals: Research Resources & Services

9-10:30 a.m.
Research Commons (3rd floor, 18th Avenue Library)

IX Health 2017: Civic Health Tech

8:30 a.m. - 5 p.m.
Union Hall, 1311 Vine Street, Cincinnati , OH 45202

NSF Midwest Big Data Hub All-Hands Meeting

Kiewit University, 1450 Mike Fahey St., Omaha, NE 68102

Resource Workshop for New Faculty, Session 5: Industry Engagement 101

9-10:30 a.m.
Scott Lab, 1st Floor Conference Room E100

Resource Workshop for New Faculty, Session 6: Resources for Increasing the Visibility of Your Research

9-10:30 a.m.
Research Commons (3rd floor, 18th Avenue Library)

Lecture: High-Res Cities: An Overview of Urban Dynamics Institute at Oak Ridge National Laboratory

12-1 p.m.
1080 Derby Hall

Workshop: Increasing Openness and Reproducibility in Quantitative Research

9 a.m. - noon
Research Commons (3rd floor, 18th Avenue Library)

Roundtable Discussion: Careers in Data Analytics

5:45-7 p.m.
Research Commons, 3rd floor of 18th Avenue Library

Researcher Networking: "Research, Short and Sweet"

5-7 p.m.
STEAM Factory, 400 W. Rich St.