Los Angeles

Data Con LA + imDATA 2022
in-Person at USC
August 13th - Data Con LA & August 14th - ImData

We look forward to seeing you again at Data Con LA 2022! Please note Data Con LA 2022 is scheduled for August 13th. & ImData is August 14th. See Data Con LA’s schedule here. See ImData’s schedule here.

If you would like to attend Data Con LA virtually for free, please sign up here

Register Now
Data Graphic

2022 Keynote Speakers

Ron Galperin

Ron Galperin

Controller at City of LA

Ron Galperin is the independently elected Controller of the City of Los Angeles. He serves as the watchdog for taxpayers at City Hall, making sure public dollars are spent efficiently and effectively. Galperin oversees a team that audits municipal departments and programs, manages payroll and spending, reports on the City’s finances, pursues fraud and waste, and works to create a more transparent, accountable, equitable and modern government for everyone. Galperin launched ControlPanel LA, the City’s first open data portal, in 2013. During the pandemic, he created the L.A. Equity Index to help address local barriers to opportunity and released a series of resource maps and financial dashboards to aid businesses and residents. Read more about him here.

Read Bio
Karthik Ramasamy

Karthik Ramasamy

Head of Streaming at Databricks

Karthik Ramasamy is the Head of Streaming at Databricks. Before joining Databricks, he was a Senior Director of Engineering, managing the Pulsar team at Splunk. Before Splunk, he was the co-founder and CEO of Streamlio that focused on building next-generation event processing infrastructure using Apache Pulsar and led the acquisition of Streamlio by Splunk. Before Streamlio, he was the engineering manager and technical lead for real-time infrastructure at Twitter where he co-created Twitter Heron, which was open sourced and used by several companies. He has two decades of experience working with companies such as Teradata, Greenplum and Juniper in their rapid growth stages building parallel databases, big data infrastructure and networking. He co-founded Locomatix, a company that specializes in real-time streaming processing on Hadoop and Cassandra using SQL, which was acquired by Twitter. Karthik has a Ph.D. in computer science from the University of Wisconsin, Madison, with a focus on big data and databases. During his college tenure, several of the research projects he participated in were later spun off as a company acquired by Teradata. Karthik is the author of several publications, patents and a popular book, Network Routing: Algorithms, Protocols and Architectures.

Read Bio
Peter Loo

Peter Loo

Acting Chief Information Officer, Los Angeles County

Peter Loo is the Acting Chief Information Officer of Los Angeles County. With over 35 years of public sector IT experience, he serves as the principal adviser to the CEO and provides strategy, leadership and guidance on enterprise IT initiatives, IT Governance and the delivery of information technology capabilities to support the County mission and Board priorities. Prior to joining the Los Angeles County, Mr. Loo was Director of Consulting Services at a large multi-national firm, where he was responsible for the successful delivery of multiple public-sector ERP and public health projects at State and local governments.

Read Bio
Soheil Katal

Soheil Katal

Chief Information Officer at Los Angeles Unified School District

Soheil Katal is an enterprising, innovative, solution-focused IT leader, experienced in enterprise architecture management and a proven successful project leader. Twenty years of experience in both the public and private sectors and well-rounded expertise in business insight and technology innovation catapulted him forward to serve as the Chief Information Officer (CIO) of the Los Angeles Unified School District (LAUSD), the nation’s second largest school district. As CIO, Soheil oversees one of the largest technology environments in the nation, with over 500,000 student and employee devices accessing a network of nearly 100,000 access points daily, 30,000 miles of cabling, and 16 petabytes of storage, 1,000 times the size of the Library of Congress. Soheil holds a Master of Business Administration degree in Information Technology from the Florida Institute of Technology and multiple technical industry certifications. He is a member of Phi Kappa Phi.

Read Bio
Lakshmi Sharma

Lakshmi Sharma

Chief Product and Strategy Officer at Fastly

Lakshmi Sharma has decades of experience driving strategy and transformation in engineering, business, and IT, leveraging people, process and technologies for successful business and cultural outcomes. She is currently the Chief Product and Chief Strategy Officer at Fastly, A Global Programmable Edge cloud platform designed to help Enterprises and developers extend their core cloud infrastructure, closer to users. Before Fastly, she was the Director of Product Management for Networking in Google Cloud, where she was responsible for end-to-end customer experience for Networking and Networking Solutions, crossing multiple industry verticals. She also led Products and Engineering at Cisco, Brocade, RIFT.io, Apcera, and Target. Lakshmi is passionate about building inclusive and transparent organizational cultures where everyone feels supported in their growth and empowered to make an impact in the organization. In personal life she believes education is the best way to bring equality. She spends time supporting and growing communities by creating opportunities that were not available to them because of access to equal education and resources to grow.

Read Bio
Eva Pereira

Eva Pereira

Chief Data Officer at City of Los Angeles

Eva Pereira is the Chief Data Officer, for the City of Los Angeles. Her team leads the delivery of a wide range of data projects around topics like racial equity in programs and services, census outreach, COVID response, and more. In addition to this, she manages the open data portal, as well as community and academic engagement programs that leverage the City’s open data.

Read Bio

2022 Sponsors

Host

2022 - University of Southern California USC

Gold

2022 - Boomi
2022 - Instaclustr
2022 - MongoDB
2022 - Starrocks

Silver

2022 - CalTech

Coffee

2022 - AWS

Partner

2022 - Archangel Technology Consultants, LLC
2022 - Cloudtrackr
2022 - DAJ
2022 - Data 4 Good
2022 - LOPSA
2022 - MountainTop Data
2022 - OpenSUSE
2022 - RMDS
Data Graphic

2022 Speakers

Aboli Khairnar

Aboli Khairnar

Data Scientist at Citi Ventures

Aboli is an Associate Data Scientist on Venture Innovation's Studio team, where she helps leverage the power of data science and machine learning to maximize their impact. She is passionate about using data for social good. She was previously a part of the CUPID program, where she worked as a Data Scientist Intern for Studio. Before joining the Studio team Aboli worked at Stantec, where she served as a Data Analyst analyzing and predicting flood frequency risk for rural catchments. She graduated with a master's degree in Applied Economics from the University of San Francisco and holds another master's degree in Water Resources Engineering from the Indian Institute of Technology Bombay.

Read Bio
Alison Bunce

Alison Bunce

Product Designer at Meta

Alison is Product Designer at Meta with a background in Data Science and Fashion. Alison has worked as a multidisciplinary designer building data visualization heavy products for brands such as Starbucks, SESAC, WarnerChappell, SONY/ATV and FINRA, and also worked as a Data Analyst for General Assembly. Committed to life long learning, Alison regularly takes bootcamp courses, but also teaches courses in Experience Design and Data Science and Data Analytics.

Read Bio
Anand Ranganathan

Anand Ranganathan

Chief AI Officer at Unscrambl

Anand Ranganathan is a co-founder and the Chief AI Officer at Unscrambl. He is leading Unscrambl's product development in several cutting-edge areas, including natural language processing, automated insights, data story-telling, and marketing optimization. Before joining Unscrambl, he was a Master Inventor and Research Scientist at IBM. He received his PhD in Computer Science from UIUC, and his BTech from the IIT-Madras. He also has over 70 academic journal and conference publications and 30 patent filings in his name.

Read Bio
Anil Inamdar

Anil Inamdar

VP & Head of Data Solutions at Instaclustr

Anil is the VP & Head of Data Solutions at Instaclustr by NetApp, which provides a managed platform for open source data-layer and developer workflow technologies. Anil has 20+ years of experience in data and analytics roles. Joining Instaclustr in 2019, he works with organizations to drive successful data-centric digital transformations via the right cultural, operational, architectural, and technological roadmaps. Prior to Instaclustr, he held data & analytics leadership roles at Dell EMC, Accenture, and Visa, among others. Anil lives and works in the Bay Area.

Read Bio
Antje Barth

Antje Barth

Principal Developer Advocate, AI/ML at AWS

Antje Barth is a Principal Developer Advocate for AI and Machine Learning at Amazon Web Services (AWS). She is co-author of the O'Reilly Book, "Data Science on AWS." Antje is also co-founder of the global "Data Science on AWS" Meetup. She frequently speaks at AI and Machine Learning conferences and meetups around the world, including the O'Reilly AI and Strata conferences. Besides ML/AI, Antje is passionate about helping developers leverage Big Data, container and Kubernetes platforms in the context of AI and Machine Learning.

Read Bio
Arif Ansari

Arif Ansari

Professor at University of Southern California

Arif Ansari is Professor at Marshall School of Business in University of Southern California, Los Angeles and the founder of Intuition Intelligence Inc. He is an expert in Big Data Business Analytics, Digital Marketing and Intelligent Systems. He holds multiple US and international patents Learning Systems. He has given many keynote speech on Business Analytics. He is an Advanced Predictive Analytics visionary and thought leader; he has developed a framework for Creative Analytics and has built advanced analytics engines for challenging problems. He has created real-time Viral Prediction Engine and Super Relevance Engine and custom-built Advanced Analytical Engines for the Retail, Finance and Media sectors. His guest editorial with Gerard Tellis and Yusuf Ansari (Intuition Intelligence) on Super Bowl 2022 Ad Virality Prediction and Insights is published in the World Advertising Research Center (WARC). His recent work with Wavemarker and Netflix won the ICOM Data Creativity Top Award in June 2022.

Read Bio
Ash Pahwa

Ash Pahwa

Instructor at Caltech

Ash Pahwa, Ph.D., is an educator, author, entrepreneur, and technology visionary with three decades of industry and academic experience. He specializes in the Machine Learning, Deep Learning, and Search Engine Optimization. Dr. Pahwa earned his doctorate in Computer Science from the Illinois Institute of Technology in Chicago. He is listed in Who's Who in the Frontiers of Science and Technology. He is also a Google Certified Analytics Consultant. In Industry, Dr. Pahwa has worked for General Electric, AT&T Bell Laboratories, Xerox Corporation, and Oracle. He founded CD-Gen, Inc. and DV Studio Technologies, LLC., which introduced successful products for CD-Recording (CDR) and MPEG encoding. His book, CD-Recordable Bible was published in English, Japanese, and German. In Academia, Dr. Pahwa teaches courses at California Institute of Technology (Pasadena) and the University of California system. Since 2008, he taught many courses at UC Irvine, UCLA, and UC San Diego.

Read Bio
Bob Newstadt

Bob Newstadt

Senior Director of Business Intelligence at HopSkipDrive

Bob Newstadt is a data expert and currently the Senior Director of Business Intelligence at HopSkipDrive, a transportation service for kids. Bob leads a team responsible for data science, analytics, data culture, data infrastructure, data governance, and data products. He is also a mentor with the SpringBoard School of Data. Bob has computer science degrees from CMU (Applied Math) and UCLA (Databases and AI).

Read Bio
Brandon Wong

Brandon Wong

Lead Software Engineer at Academy Of Motion Picture Arts and Sciences

As a Lead Software Engineer at the Academy of Motion Picture Arts and Sciences, Brandon is responsible for everything from software design to data analytics to infrastructure. He has an educational background in business analytics and finance. It was when he began working at Northrop Grumman, that he decided to officially turn to computer science full time. Approximately 8 years, 12 programming languages, 18 hackathons, and over +10000 GitHub commits later he has done software development in the following industries: real estate, aerospace, financial software, internet marketing, and currently film/entertainment. One of his true passions, in addition to working with technology, is teaching, inspiring, and leading others who are interested in pursuing careers in tech-related fields.

Read Bio
Chirag Katbamna

Chirag Katbamna

Senior Manager at Accenture

Chirag has extensive experience in Data and Analytics and has helped numerous companies implement best practice analytics platforms. He currently works with Accenture as a Senior Manager in their Strategy & Consulting practice with focus on Applied Intelligence.

Read Bio
Chris Fregly

Chris Fregly

Principal Developer Advocate, AI and Machine Learning at AWS

Chris Fregly is a Principal Engineer,  AI and Machine Learning at Amazon Web Services (AWS) based in San Francisco, California. He is co-author of the O'Reilly Book, "Data Science on AWS."

Read Bio
Christian Bourdeau

Christian Bourdeau

Analytics Manager at Activision | Blizzard

By day, Christian provides critical business insights for Activision by building optimized interactive dashboards, business processes, and data pipelines. By night, he hopes to inspire the next generation of Business Intelligence and Data Analysts through LinkedIn, Social Audio, and YouTube and podcast coming soon!

Read Bio
Chuck Chekuri

Chuck Chekuri

Co-Founder & CDO at togetherMDM

As a data professional, entrepreneur, and industry consultant, he helps companies to be more data-driven. That includes data management, data quality, data governance, reporting, and advanced analytics for all business units including finance, sales, marketing, operations, and client experience. Familiar with big data solutions on-prem and in the cloud. Hands-on expertise in machine learning, artificial intelligence, and blockchain solutions for data management. Each day, he mentors and coaches teams to produce analytics and insights that inspire and motivate data-driven decisions at scale. He is currently working on a stealth startup to transform master data governance with blockchain and smart contracts.

Read Bio
Curtis O’Dell

Curtis O’Dell

Global Director Data Integrity at Tricentis

Curtis O'Dell is the business product manager for the data integrity offering for Tricentis. He has over 30 years of experience, having built fraud and risk data solutions for the largest banks in the U.S. His testing background includes every platform from mainframe to mobile. He has worked with testing embedded systems for Lockheed Martin's F-35 and Medtronic's pacemakers as well as the largest financial and business systems for ExxonMobil, Nationwide Insurance, and WorldPay. His experience includes managing the Emerald Health Service Bureau for Ross Perot's Perot Systems and building risk data systems for some of the largest banks in the southwest. A University of Texas at Austin graduate in Geophysics, a graduate of the System Engineer Development (SED) program at Ross Perot's EDS (now HPE), and Harvard Business School Business Analytics certified, he brings a breadth of knowledge of data and how to achieve solid data quality.

Read Bio
David Kjerrumgaard

David Kjerrumgaard

Developer Advocate at StreamNative

David is a committer on the Apache Pulsar project, and also the author of "Pulsar in Action" and co-author of "Practical Hive". He currently serves as a Developer Advocate for StreamNative where he focuses on strengthening the Apache Pulsar community through education and evangelization. Prior to that he was a principal software engineer on the messaging team at Splunk, and Director of Solutions for two Big Data startups; Streamlio and Hortonworks.

Read Bio
David Sarabia

David Sarabia

Founder / CEO at inRecovery

David is a serial entrepreneur. He co-founded UPrinting at 22 which Insight Partners acquired in 2010. Betting we'd all shop on our phones in the future, he founded the first m-commerce platform built for touch, which he sold to Magento. His struggle with mental health and addiction led him to homelessness. The overdose of a friend led David to seek help and checked himself into treatment. There he had a vision for data-driven care and inRecovery was born, the "operating system" helping modernize the addiction care industry, increasing access to care, and helping democratize reintegration to give people a real chance to thrive after crisis.

Read Bio
Debu Sinha

Debu Sinha

Sr Specialist Solutions Architect - AI/ML at Databricks

Debu is a Senior Solutions Architect at Databricks focused on implementing/optimizing machine learning and deep learning capable pipelines at scale. Previously he has co-founded a real-time identity graph management and analytics company called Throtle onboarding. Before that, he founded a nonprofit org in India that aimed at increasing education access to remote parts of India using virtual classrooms. In his current role, he interacts and aligns strategically with the technical and business leadership of Databricks Enterprise customers. He is leveraging his strong technical background, love for public speaking, and effective communication with customers to understand their business and technical strategy and challenges. He regularly engages in architectural design and whiteboarding sessions with customers as they see him as a trusted advisor. As a Senior Solutions Architect at Lifion by ADP, Senior Engineer at V12 group, and Bank of America, he has spearheaded multiple projects involving streaming and machine learning capable pipeline creation and optimization. his passion for cloud computing, machine learning, and distributed systems began while working on his Master's research thesis on Machine Translation at Johns Hopkins University.

Read Bio
Dipanwita Das

Dipanwita Das

CEO & Co-Founder at Sorcero

Dipanwita Das is an award-winning technology entrepreneur and AI innovator. Prior to founding Sorcero, Dipanwita founded 42 Strategies and worked with the Gates Foundation, Bloomberg Philanthropies, and the International Red Cross on a Nobel Peace Prize-winning public health policy initiative that impacted billions of people around the globe. An Atlas Corps Fellow and later Board Member, she designed the Global Leadership Lab, training global leaders from 60+ countries. Dipanwita graduated from some of the world's top startup accelerators, including Y Combinator's Female Founders program and Plug & Play, and helped author 5 patented advances in biomedical AI.  

Read Bio
Don Malloy

Don Malloy

Chair JCC at OATH

Donald Malloy is a Security Consultant, Founder of the Symposium on Securing the IoT and Chairman of OATH, The Initiative for Open Authentication. He helps organizations manage their security programs with their IoT security, authentication and identity management solutions. He is currently the Director of Business Development for DualAuth, a security company with multi factor authentication. OATH is an industry alliance that has developed the MFA market and shifted it from proprietary systems to an open source standard based architecture promoting ubiquitous strong authentication used by most companies today. Mr. Malloy has more than 25 year's experience in the Security, Semiconductor and Payment industry. Don was responsible for developing the online authentication product line while at NagraID Security (Idemia) and prior to that he was Business Development and Marketing Manager for Secure Smart Card ICs for both NXP and Infineon Technologies. Don originally comes from Boston where he was educated and has M.S. level degrees in Organic Chemistry and an M.B.A. in Marketing. He resides in Southern California and in his spare time he enjoys hiking, camping and traveling around this beautiful world.

Read Bio
Elektra Grant

Elektra Grant

Lecturer at California State Polytechnic University-Pomona

Elektra has 20 years of experience as a sustainability professional. She first became involved with the intersection of arts and sustainability in 2008 through Arts Earth Partnership, which was responsible for assessing arts and cultural facilities pursuing the LA Green Business Program Certification. Through this joint venture, Elektra performed comprehensive sustainability assessments of LA landmarks such as LACMA, The Natural History Museum, the Dolby Theatre, and Skirball Cultural Center. She started teaching the first required course for the Sustainability Minor at Otis College of Art and Design soon afterward, and since then has developed, taught and mentored eight sustainability-themed, interdisciplinary college-level courses. Elektra has contributed to two other City of Los Angeles sustainability projects. She assisted in the implementation of the RecycLA program, which aims to reduce greenhouse gas emissions while diverting 90 percent of the City’s landfill waste by 2025. She currently is a board member of Westwood Greenway, Inc., which partners with LA Sanitation & Environment. The Westwood Greenway cleans stormwater through natural processes, while providing an island of native habitat for birds, butterflies and other animals next to one of Metro Expo Line’s busiest stops.  Elektra is a LEED Green Associate and a Certified Sustainable Building Advisor. She has a Master of Science in Regenerative Studies (Sustainability), and a Bachelor of Fine Arts in Film, Video and Animation. Elektra is from Los Angeles – when she was a public art muralist, she apprenticed with veteran Chicano muralists East Los Streetscapers, and contributed substantially to a Koreatown mural which has been replicated and is on permanent display at the Petersen Auto Museum.

Read Bio
Elijah Ben Izzy

Elijah Ben Izzy

Data Platform Engineer at Stitchfix

Elijah has always enjoyed working at the intersection of math and engineering. More recently, he has focused his career on building tools to make data scientists more productive. At Two Sigma, he was building infrastructure to help quantitative researchers efficiently turn ideas into production trading models. At Stitch Fix he is a founding member of the Model Lifecycle team — a team that focuses on streamlining the experience for data scientists to create and ship machine learning models. In his spare time, he enjoys geeking out about fractals, looking at antique maps, and playing jazz piano.

Read Bio
Emad Georgy

Emad Georgy

CTO at Georgy Technology Leadership

Named Entrepreneur Magazine's Top CTO to Watch in 2022, Emad is recognized globally as an execution-focused CTO, board member and powerful transformational leader. His passion is both developing durable architecture and developing leaders. He has led some of the world's largest digital transformations in financial services, retail, healthcare and gaming. Emad is the CEO of Georgy Technology Leadership and the creator of the first global standard for measuring Technology Health(www.technology.health), a standard implemented in thousands of organizations. He has held CTO positions at a number of enterprise brands including Experian. Emad has also contributed code to the Windows platform as well as a number of IBM's products. He was also one of the first 50 in the world to implement HBase in a production environment. Emad has been recognized for his leadership with numerous industry awards including the Computerworld IT 100 award, and has been featured on a number of podcasts about technology leadership. Emad has a passion for growing leaders in technology.

Read Bio
Eric Wilker

Eric Wilker

Head of Partnerships and Business Development at Amazon/ IMDB

Longtime Warner Bros. executive Eric Wilker of Wilker Media Group oversaw all operations and strategy for the Studio’s Worldwide Television Marketing unit, an award-winning, full-service in-house agency providing a wide range of advertising, marketing, promotions, digital media, data, publicity and communications solutions to the production and distribution divisions of the Warner Bros. Television Group. He led a team focused on global strategy as well as business and partner development. Currently, Wilker leads an advisory practice in the TMT sector with clients ranging from AT&T, Inc. to independent studios. Before joining the Studio, Wilker was an independent television producer, serving in a variety of roles on a number of series and specials. He holds a Bachelor of Arts from Skidmore College, an MBA from The University of Southern California and is a member of the Producers Guild of America.

Read Bio
Fadi Azhari

Fadi Azhari

VP of Marketing at StarRocks

From Sun Microsystems to AWS and several startups that have transformed markets, Fadi Azhari has been leading the global conversation around computing and data analytics for over two decades. Today, he continues in his role as a data analytics industry thought leader as StarRocks’ VP of Marketing. At StarRocks, Fadi oversees the organization’s international go-to-market strategy, helping to educate engineers and business leaders on how to transform their data into powerful innovations and working directly with some of the world’s largest enterprises to help them deliver real-time analytics to their data teams.

Read Bio
Frank Bell

Frank Bell

Snowflake SME at Accenture & CEO at ITS

Frank has been the Snowflake Data Superhero for 3 years. He is also a SnowPro, Top Snowflake SME at Accenture (part time). He is a Data Thought Leader focused on data driven business, data sharing, and migrating data systems to the cloud. He is the Founder and CEO of a highly profitable technology/data consulting business for 19 years which was sold in 2019. Our consulting business delivered over a billion dollars of market cap growth to our clients. Frank's focus in most recent years has been on building inventions and teams which enable the intersection of automation, AI, and data to enable automated data driven business business. Frank's teams have built both the Saas Powered By Snowflake Solution - Snoptimizer which optimizes cost, security, and performance on Snowflake as well as the educational and snowflake data system community at ITS Snowflake. He has Built Snowflake Users Group in LA. Also, Frank was the main author of the popular Snowflake Essentials book.

Read Bio
Garnie Bolling

Garnie Bolling

Senior Solutions Architect at Boomi

Garnie Bolling is a Senior Software Consultant who has been working with data and data-centric projects for over 20 years. He comes from a wide background of experiences from his early days in eCommerce, Master Data Management, Data Quality, Data Governance, and Metadata strategies. He has led projects and engagements as a business consultant and software architect. Currently, as part of Boomi, LLP. Garnie helps organizations navigate the challenges in discovering, understanding, and leveraging data that has gone untapped. His philosophy is that all IT projects deal with Data, so knowing your data, understanding how to leverage it, democratizing it, and properly governing is the path to success.

Read Bio
George Mansoor

George Mansoor

Chief Information Systems Officer at California State University

George is a higher education technology leader at the California State University (CSU). The CSU has 23 campuses and the Chancellor's Office with 43,000 faculty and staff, 412,000 students and is the largest public university in the United States. Leadership responsibility include strategic and operational services for system-wide administrative systems (Finance, Human Resources , Student) and system-wide data ops and reporting systems.

Read Bio
Graham Symmonds

Graham Symmonds

Co-founder & Chief Executive Officer at Pluvial.ai

As a leader in the technical, environmental, and utility fields, Graham has a solid reputation for technical excellence and superb analytical capabilities.  Forged from the leadership and systems analyses requirements as an Engineering Officer in the Royal Canadian Navy, Graham has developed and shaped organizations from the ground up, focusing on encouraging the inquisitive and investigative traits required to effectively and efficiently manage businesses. ​

Read Bio
Harrison Tang

Harrison Tang

CEO at Spokeo

Harrison is Chief Executive Officer and Co-founder of Spokeo. He started Spokeo with his college roommates Mike Daly (CTO) and Eric Liang (CIO) in 2006 in his parents basement in 2006. and serves as Spokeo's Chief Executive Officer. Harrison guides Spokeo's product vision to build an experienced team that helps the company realize its mission of making the world around us more transparent. Under his leadership, He Harrison has helped build and scale Spokeo ’s user base from zero to serving tens of millions of users. He is passionate about building products that help customers get their jobs done. Harrison earned a B.A.S. degree in Electrical Engineering and Economics and a M.S. degree in Electrical Engineering at Stanford University.

Read Bio
Hugo Bergmann

Hugo Bergmann

Product Marketing Manager Lyve Cloud and Data Services at Seagate

Hugo Bergmann has been working with data and storage for over two decades. He is a product management and marketing professional for IT storage products in private, hybrid or multi-cloud data centers and has deep experience in creating top notch horizontal and vertical storage solution content and go to market strategies. Hugo also has experience in leading customer-focused interdisciplinary teams with global and multicultural team members to increasingly-higher performance levels and aggressive business objectives. He is based in Germany.

Read Bio
Jagjit Dhaliwal

Jagjit Dhaliwal

VP, Global CIO Industry Leader at UiPath

Jagjit Dhaliwal has recently joined as VP, Global CIO – IT Industry Practice Leader at UiPath. In this role, he is responsible for assisting client CIOs in their automation strategy. He brings in 20+ years of technology experience transforming public entities and Fortune 500 companies in multiple industry domains across various geographies. In his last role, Jagjit was a Deputy Chief Information Officer at Los Angeles County and acted as a Strategic Advisor to Public Safety and Health Departments. He collaborated with Department CIOs to foster digital transformation through IT strategy planning, organization digital & data maturity assessment, portfolio rationalization, governance, technology modernization, and adoption of emerging technologies like AI, RPA, IoT & Blockchain. He led countywide Data Strategy as well as developed a regional strategy for closing the Digital Divide gaps in the Los Angeles region. Before this, he worked with Cognizant Technology Solutions & Tata Consulting Services bringing in years of consulting, delivery, and sales experience. Jagjit has an executive MBA degree from the Massachusetts Institute of Technology (MIT) Sloan School of Management and a Bachelor of Engineering in Electronics from Punjab University, India.

Read Bio
Jai Bansal

Jai Bansal

Senior Manager, Data Science at Aetna

Jai is a  seasoned data scientist with experience leading high performing teams and building analytical products in organizations ranging from scrappy startups to established multinationals. He has a proven track record of combining team management, technical expertise, and thought leadership. He is a technical expert in deep/machine learning, statistics, experiment design, and data visualization.

Read Bio
Jason Kang

Jason Kang

Software Engineer at NASA-JPL

Jason came from computer science background, with masters in computer science and data science. He has been with JPL for more than 10 years, mostly building web-based applications. More recently, he transitioned to a data engineer role that deals with Earth related data science projects. He spends most of his time developing data pipelines and patching/improving data analytic platform.

Read Bio
Jauher Zaidi

Jauher Zaidi

Founder, Chairman & CINO at Palmchip

Jauher is an award-winning tech entrepreneur, inventor and engineer with nearly 35 years experience. He is the chairman and CEO of Palmchip (CyberSecurity for Industrial IoT) , Netvinci (Oculus Metaverse), Moobila (Digital Marketing) where he oversees all Sales, Marketing, design, engineering and happiness. He founded Palmchip Corporation, a company that invented CoreFrame, the first System-on-Chip (SoC) architecture, which enables billions of wireless and storage devices today. He has been named twice as the "Top 20 visionary CEOs in the world" by the EE Times and was awarded the 2010 Terman Award for best Cloud Computing/Software as a Service product by the TechAmerica Foundation. He holds several technology patents having to do with CyberSecurity, SoC and on-chip interconnect technology. A leading expert in the area of SoC technologies, Jauher has participated on many panels and has written and presented a number of articles ranging from CyberSecurity, System-on-Chip (SoC), Artificial Intelligence (AI) and Machine Learning (ML) technologies. In 1996, Jauher left Quantum Corporation to start Palmchip. Before that, he led a team to develop the world's first SSD system-on-chip controller at Quantum. He also worked at Unisys, Amdahl and Samsung. Jauher is an experienced in running several global companies like Moobila Corporation and Palmchip Pakistan (Pvt). He is always looking to invest in new startups. Please send your startup ideas to [email protected]

Read Bio
Jay Kim

Jay Kim

Data Scientist at Golden State Water Company

Jay is a Highly efficient and results-oriented multi-tasking data scientist/engineer with strong quantitative skills, development experience and strong education background with a MSc (Imperial College London (World Rank within Top 10 QS)). He is a Responsible self-starter with demonstrated experience in statistical programming language (R, Python, SAS, Scala) and programming language python for API’s.

Read Bio
Jaysen Gillespie

Jaysen Gillespie

Head of Analytics and Data Science at RTB House

Jaysen is a Southern California analytics, marketing, product, revenue and data science executive overseeing all aspects of analytics and data science for RTB House, a leading digital marketing ad-tech company. He has 15+ years of experience in technology and analytics leadership roles. He has managed global teams both on a remote and in-office basis and specialize in finding ways to bring data to the entire organization. He is a big proponent of making data both useful and FUN!

Read Bio
Jeanne Holm

Jeanne Holm

Deputy Mayor at City of Los Angeles

Jeanne Holm is the Deputy Mayor for Budget and Innovation of the City of Los Angeles, addressing technology, equity, digital inclusion, and fiscal accountability through municipal broadband, participatory budgeting, and digital transformation of government services. She was the evangelist for open data for the U.S. Government under President Obama, the lead for Africa open data for the World Bank, and the Chief Knowledge Architect at NASA. She serves as an advisor for the U.N. on data and astronautics, is a Trustee of Claremont Graduate University, is on the Global Leadership Board for Time’s Up, and teaches at UCLA.

Read Bio
Jennison Asuncion

Jennison Asuncion

Co-Founder and Vice-Chair at GAAD

Jennison Asuncion co-founded the annual Global Accessibility Awareness Day (GAAD) event in 2012 alongside Joe Devon. Working in the digital accessibility industry since 2006, He is Currently LinkedIn’s Head of Accessibility Engineering Evangelism. Jennison's industry and community contributions have earned him recognition including being named in 2020 by Business Insider as "1 of 30 power players helping new CEO Ryan Roslansky run LinkedIn.

Read Bio
Jie Chen

Jie Chen

Manager Advisory at KPMG

Jie is certified multi cloud solution architect with her strength in both breath and depth of understanding hybrid cloud infrastructure and Software Development Life Cycle. She is specialized in leading enterprise modernization with innovative solutions. As advisory manager at KPMG, her experience in integrating Kafka data streaming platform through hybrid cloud strategies has helped her client overcome data challenges and outpace fast growing demand.

Read Bio
Joe Devon

Joe Devon

Founding partner at Diamond and Co-Founder of Global Accessibility Awareness Day

Joe is a serial entrepreneur who has been fostering the growth of the Los Angeles tech ecosystem since 2008. His experience includes co-founder at Diamond, a digital agency that builds accessible experiences, and Co-Founder and Chair of the Global Accessibility Awareness Day (GAAD) Foundation. In 2019 Joe and the Diamond team launched their first State of Accessibility Report. Joe has more than 20 years of development experience in high-profile projects spanning Digital Media, Search Engines, Performance Management; for Internet backbone providers, Investment Banks, Telcos, Big Pharma and Big Media. When Joe is not solving technology challenges for major organizations, you can find him on stage influencing every developer and corporation to build their digital products with inclusive design in mind.

Read Bio
Joe Long

Joe Long

President at Summit Capital _California

Joe is the president at Summit Capital _California

Read Bio
John Sweet

John Sweet

Sr. Director, Accessibility and Technology Compliance at Pluto TV

I help product development teams innovate and build captivating interfaces that work for everyone. Accessibility Engineer | 6+ years of experience with assistive technologies, WCAG compliance and accessible web development. Innovator | Driving new technological advancement for people with disabilities. People Leader | Developing accessibility professionals, leading a team of people with varying abilities, producing meaningful results. Telecommunications Inclusion | 4+ years in telecom specializing in digital video interfaces, audio description and CVAA law. Data-Driven Results | Leveraging user research to make interface design decisions - Master of Science in Human Factors Psychology (i.e., Human-Computer Interaction).

Read Bio
John Thorpe

John Thorpe

Head of Product at BreezeML

John Thorpe is the head of product at BreezeML, a startup focused on developing infrastructure support to democratize AI/ML. John recently graduated from UCLA with his PhD in Computer Science with a focus in operating systems, distributed systems, and ML systems. He has built his thesis on developing techniques that utilize the cloud to enable affordable training of resource intensive models.

Read Bio
Keith Camoosa

Keith Camoosa

Chief Addressability Officer at Dentsu

In his role, Keith drives proficiency across dentsu’s agencies to ensure each are coordinated in delivering on an addressable future for clients. He works to find innovative ways to connect first-party data to supply sources. He also partners with Merkle and dentsu product leaders to elevate existing solutions like M1’s Publisher Addressable Marketplace, DELTA (Data Enabled Linear TV Activation), and dentsu’s contextual engine. Keith has a diverse background developing and leading marketing technology, consumer insights, and data science functions on both the agency and marketer side. Keith most recently served as the Global Chief Data and Analytics Officer at Initiative. Before that, Keith was SVP, Data Insights & Operations at WarnerMedia, where he modernized Warner Brothers' marketing technology capabilities and led the development of the data infrastructure for the launch of HBO Max.  He also held senior insights and analytics roles at IPG Mediabrands, TNS, and Yahoo!.

Read Bio
Kirill Makharinsky

Kirill Makharinsky

Founder & CEO at Enki

Kirill is the Founder and CEO of Enki, the company that helps teams with adoption of business intelligence and data tools. Previously, he co-founded Emerging Travel Group (including the brands RateHawk & ZenHotels) a profitable OTA with net annual revenues of over $100M, and Quid, a leading research and analysis tool for strategy teams with revenues of over $50M. In addition to his entrepreneurial ventures, he is an angel investor in Calm, AngelList, Opencare, and more.

Read Bio
Laura Garcia

Laura Garcia

Director of Clinical Design at Found

Laura Garcia PhD. is a Clinical Psychologist with a strong passion for the development of tech platforms that can support individuals’ mental health and wellbeing. She has expertise in behavioral science, therapeutic content design, and the evaluation of innovative digital therapeutics. Her creative process is informed by direct experience by providing evidence-based mental health treatment and assessment. Laura is the Director of Clinical Design at Found and former Director of Research, Design, and Innovation at Applied VR. She earned her doctorate of psychology in clinical studies at the University of Southern California.

Read Bio
Linda Liu

Linda Liu

Head of Data Analytics and Data Science at HyreCar

Linda Liu has extensive experience helping a business build analytics and data science foundations as well as roadmaps that empower data-driven decisions and improve user experience across the organization. She is also a big advocate for cultivating a data centric culture that enables teams across the business to better utilize data / analysis and bring added value to the day to day work.

Read Bio
Mariana Danilovic

Mariana Danilovic

Managing Director at Infiom, LLC

Mariana is the founder of Infiom, a global consensus blockchain economy for creation of Web3 ventures using DeFi, NFTs, DAOs, and Metaverse principles. Prior to Infiom, she was the Managing Director of BitAngels, the first crypto angel network formed in 2013 that invested in the launch of Ethereum. She is an Advisor to SuperWorld, Open Therapeutics, Anode (Pkt), Monax, System9, EarnTV, SafeGram, BuddyDAO, Kirobo, and CryptoArt Agency. Mariana was EVP Business Development for NTN Buzztime, Inc. (AMEX:NTN) and Head of Business Development, Content Aggregation and Programming for MediaZone, a broadband video company launched and owned by NASPERS Internet (largest investor in Tencent). Mariana also founded and directed the Digital Media Incubator at KPMG LLP.  Portfolio companies raised angel investments from Michael Jackson, Beny Alagem, Al Copeland, IdeaLab, David Gefen, Brian Singer, Bob Daly, Jeff Pollack of Handprint Entertainment and John Gee, Executive Director of NASA Ames Research Incubator, and VC funding from Kleiner Perkins, Sequoia, Intel, HP, IBM, Microsoft, Dell, Panasonic, and Chase Capital Partners. Mariana served on the New Media Board and serves on the International Committee of the Producers Guild of America and is a member of the Television Academy.   Mariana served on the Board of the Tim Draper's Zone Ventures Club and on the Mayor Riordan's Digital Coast Roundtable.

Read Bio
Mark Jackson

Mark Jackson

Quantum Evangelist at Quantinuum

Dr. Mark Jackson is the Senior Quantum Evangelist at Quantinuum. He received his Ph.D. in Theoretical Physics from Columbia University. He then spent 10 years researching superstring theory and cosmology, co-authoring almost 40 technical articles. To promote the public understanding of science, he founded the science crowdfunding platform Fiat Physica and non-profit Science Partnership Fund. He is Adjunct Faculty at Singularity University and a Director of Astronomers Without Borders.

Read Bio
Mark Quinsland

Mark Quinsland

Sr. Field Engineer at Neo4j

Mark is a field engineer at Neo4j where he helps global financial organizations solve complex problems using graph algorithms.

Read Bio
Martin Devon

Martin Devon

VP Digital Transformation at Autonomous Health

Martin Devon is an entrepreneur working on improving health outcomes by using better data. From the early application in health information exchange to using data to improve mental health, Martin has learned how important privacy and security must be balanced against the power of data.

Read Bio
Melinda Thielbar

Melinda Thielbar

Data Science Practice Lead and Director of Data Science at Fidelity Investments

Melinda takes big ideas and makes them work. She is a researcher and statistical software developer with over twenty years of industry experience. Her career ranges from statistics education to developing algorithms for commercial software to Hollywood script consulting. She co-founded Research Triangle Analysts, a 501(c)(3) not-for-profit focused on education and training for people working in the analytics field in the Raleigh-Durham area she currently works as a Practice Lead in Fidelity Investments Workplace Investing.

Read Bio
Michael Melore

Michael Melore

Senior Cybersecurity Advisor at IBM

Michael is an IBM Senior Cyber Security Advisor, Certified Information Systems Security Professional, Author, frequent National public speaker and moderator, FBI InfraGard committeeman, and recognized subject matter expert in Security and Threat Intelligence, Identity Access Governance, and Authorization. He is a pioneer in Identity and Access Management, Access Governance, and Authentication. His past consulting roles include: Lead architect for many of the world's largest authentication and authorization infrastructures. This includes two of the 1st individual Billion user authentication infrastructures. He is the Founder, Moderator, and Group Secretary for 17 Regional CISO/Security Leader Round Table Chapters across the US/1200+ Security Executives. Champion, host and evangelist of regularly scheduled cyber range threat simulation exercises, cyber training and education.

Read Bio
Mike Limcaco

Mike Limcaco

Senior Analytics Specialist at Google

As a Sr. Analytics specialist with Google, Mike works with a variety of enterprise customers & partners in the Media & Entertainment industry - helping studios, publishers, brands drive innovation through the adoption of Smart Analytics services, Data Ops/Engineering best practices, and advanced AI/ML-based capabilities available through the Google Cloud Platform.

Read Bio
My (Angela) Chung

My (Angela) Chung

Data Enthusiast at San Jose State University

Angela is a data enthusiast who seeks insights from biomedical and clinical data for disease interception and prevention, so patients could be treated with appropriate therapeutic strategies to control disease development in advance.

Read Bio
Neal Fultz

Neal Fultz

Principal Consultant at njnm Consulting

Neal Fultz is a computational statistician and data scientist, with years of experience in developing predictive models and tools in topics ranging from lizard behavior to satellite tracking to tax credit forecasting. He's also an open-source enthusiast, with over 30 Python, R, and JavaScript packages to his credit, and a 2022 Google Summer of Code mentor.

Read Bio
Parker Lowrey

Parker Lowrey

SVP of Data at Whip Media

Read Bio
Peter Zaitsev

Peter Zaitsev

CEO at Percona

Peter Zaitsev is CEO and co-founder of Percona. As one of the foremost experts on Open Source strategy and databases optimization, Peter leveraged both his technical vision and entrepreneurial skills to grow Percona from a two-person shop to one of the most respected open source companies in the business. Peter is a co-author of High Performance MySQL: Optimization, Backups, and Replication, one of the most popular books on MySQL performance. Peter frequently speaks as an expert lecturer at Open Source, Databases and related conferences.

Read Bio
Preet Singh

Preet Singh

CEO at Enginelitix

Preet Singh is CEO at Enginelitix, which provides AI solutions to revenue-facing problems for Fortune 50 companies and startups. Previous experience includes head of AI at multiple startups and Microsoft AI. He studied Computer Science & Economics at the University of Rochester, and Algorithmic Economics and Artificial Intelligence at Northwestern University. His research publications include algorithm design and artificial intelligence.

Read Bio
Ravi Pillala

Ravi Pillala

Chief Data Architect & Distinguished Engineer at Intuit

Ravi is an Experienced Architect with a valuable blend of technology and business skills and a passion for innovation and building/enabling high performing teams. He is results-oriented technology leader that loves to architect and deliver cloud scale, highly secure, resilient, and overall world-class technology solutions. He is a Software and system architect that emphasizes the alignment of technology solutions with customer/developer needs to accelerate business outcomes.

Read Bio
Raz Nistor

Raz Nistor

Senior Director at Keyrus US

Razvan Nistor is a Senior Director and Data Scientist at Keyrus US. He likes to use computers to help people solve complex problems in practical ways. He also heads up the CPG/Retail practice which is specifically focused on delivering tactical AI solutions that address major problems in the industry related to supply chain, demand planning, pricing analytics, and promotional ROI.

Read Bio
Rebecca Friedman

Rebecca Friedman

Founder & President at Koru Strategies

Rebecca is driven to improve the business of government through innovation, enhanced communication and collaboration. She has 15 years of experience including consulting at a leading professional services firm, managing gov tech education events and strategic workshop delivery, and project management and execution. She launched Koru Strategies, a government technology business development advisory firm in 2020.

Read Bio
Russell Jurney

Russell Jurney

Founder at Graphlet AI

Russell works at the intersection of Natural Language Processing (NLP) and Graph Neural Networks (GNNs). He is a four-time O'Reilly author and proven leader of teams that build, ship and operate AI applications. He has extensive experience in all aspects of data science, data engineering, machine learning and ML operations that are part of building data-driven applications. He is an applied researcher and product manager with a broad range of skills from 17 years of experience building and shipping data-driven products. He is currently interested in representation learning, graph neural networks, algorithmic trading and source code search and Q&A but is open to other opportunities. He is a former LinkedIn, Hortonworks. He has founder and Startup CTO experience.

Read Bio
Scott Stroz

Scott Stroz

MySQL Developer Advocate at Oracle

Scott is a developer with over 20 years of experience in a variety of languages. In all those years, MySQl has remained the one constant is his stack. He has a passion for sharing what he has learned on his coding journey so others may learn from the mistakes he has made along the way.

Read Bio
Scott Tarlow

Scott Tarlow

Principal Applied Scientist at Hypergiant

Scott is an Principal Applied Scientist at Hypergiant

Read Bio
Seth Muthukaruppan

Seth Muthukaruppan

Consultant at Instaclustr

Seth Muthukaruppan is a Principal Consultant at Instaclustr. Seth has 25 years of software experience in the data storage and cloud industries for companies including Ericsson, Nokia, and Nasuni, among others. He specializes in data analytics, AI/ML, natural language processing, distributed systems, and clean architecture. He also has extensive experience with search technologies such as Elasticsearch, Solr, and OpenSearch.

Read Bio
Sig Narvaez

Sig Narvaez

Executive Solution Architect at MongoDB

Sig has worked as a cloud and data architect in engineering and consulting specializing in micro-service and polyglot persistence architectures. Now an Executive Solution Architect at MongoDB, Sig is helping media, entertainment & gaming organizations build scalable and reliable solutions, and has spoken at conferences like QCon and Game Developers Conference.

Read Bio
Sonny Khan

Sonny Khan

Senior Data Analytics Specialist at AWS

Sonny Khan is a specialist at AWS for Data Analytics. He has served as a founder, executive and board member at various small, and large companies. Over 24 years, Sonny has worked in IT spanning across Development, InfoSec, Operations, Data, and Sales. Sonny has worked with Financial Institutions, Mortgage Banks, Supply Chain, Consumer Packed Goods, AdTech, SaaS, Insurance, gaming, and other industry verticals. He brings a unique perspective on business transformation and cloud strategies. He is passionate about technology solutions. He is a Computer Engineer from Purdue University, working on his first book, and does standup comedy for fun. At AWS, Sonny is helping his customers realize their business objectives by adopting modern data strategies in the cloud.

Read Bio
Stephan G. Erberich, PhD

Stephan G. Erberich, PhD

Chief Data Officer & Director Biomedical Informatics at Children's Hospital Los Angeles

Stephan is a senior computer scientist and accomplished C-Level executive with in-depth technical and leadership experience. He consistently achieves success leading sizable data-driven initiatives, innovating award-winning system and data solutions, building engineering teams, and implementing regulatory compliance and data governance. He has a talent for creating transformative digital strategies and long-term sustainable results within several computer science capacities. He is an experienced communicator, educator, and entrepreneur with the ability to forge solid partnerships and build consensus across multiple organizational levels and disciplines.

Read Bio
Steve Rimar

Steve Rimar

Founder at Analytica Consulting

Steve Rimar is a data analytics expert, technology evangelist, entrepreneur, and engineer with over 20 years’ experience in the software engineering, business intelligence, and IT industries. Steve has earned multiple honors including being published in CNN, NBC, Investor’s Business Daily, InformationWeek, Accenture’s Annual Report, Qualcomm’s Report About People and holds two U.S. patents. Steve’s experience and industry recognition within the data analytics field provided him the opportunity to found Analytica Consulting in 2014.

Read Bio
Suman Debnath

Suman Debnath

Principal Developer Advocate(Data Engineering) at AWS

Suman Debnath is a Principal Developer Advocate(Data Engineering) at Amazon Web Services, primarily focusing on Data Engineering, Data Analysis and Machine Learning. He is passionate about large scale distributed systems and is a vivid fan of Python. His background is in storage performance and tool development, where he has developed various performance benchmarking and monitoring tools.

Read Bio
Suzan Oslin

Suzan Oslin

Experiential Director at UXXR Design

Suzan is a human-centered creative technology leader in the VR AR industry focused on building a regenerative future through systems thinking and innovation design methodologies. I orchestrate elegant solutions for complex technical problems by inspiring collaboration and fostering communication across roles and teams, as well as, building and managing relationships with external vendors and partners. Her success at leading award-winning XR projects is dependent on my many years of experience, ability to quickly assemble the right team and technology needed to address the problem, as well as mastery at prioritizing what it takes to demonstrate value within a short time frame with limited resources.

Read Bio
Tatiana Tikhonova

Tatiana Tikhonova

Data Scientist at Sense360/Medallia

Tatiana is a creative data professional with years of industry experience in Data Science, Financial Analytics, and Business Intelligence, living and working from New York City for the past 10+ years. Tatiana is part of an innovative data science team at Medallia, tuning robust pipelines and building sophisticated technology in order to support the Sense360 platform that transforms the world of market research analytics and empowers businesses with competitive intelligence, behavioral benchmarking, and 360-degree analytical insights. Tatiana's expertise in data is complemented by her background in finance, Master's in linguistics, and many certificates in various areas of IT. In her spare time, Tatiana contributes to the Data Science community by mentoring women in tech and wrangling data pro bono with equal enthusiasm. Driven by the passion to learn and contribute, she continues to pursue the never ending path to growth, developing her skills and sharing her knowledge.

Read Bio
Tom Henricksen

Tom Henricksen

Humans Skill Enabler at Code is Easy

Tom Henricksen is a problem-solving technology professional. He has worked in various roles in technology for over twenty years. Tom has learned how to solve challenging issues in technology and lead technical teams. He can help you develop those skills too!

Read Bio
Tracey Trench

Tracey Trench

Founder/CEO at T2 Content Ventures

Tracy is a skilled strategist and entrepreneur creating value at the intersection of content and tech. She is a thirty year entertainment industry veteran/key executive leading teams that built Dreamworks Animation/Shanghai, global Discovery content apps, and the AR kids/family product Amazon Glow. She is currently pressure-testing concepts for web3/metaverse games and education content.

Read Bio
Venus Banuelos

Venus Banuelos

CEO at IMO

Experienced CEO / Producer / PR Manager with a demonstrated history of working in the entertainment industry. Skilled Digital Strategist, Search Engine Optimization (SEO), Marketing Management, Event Management, and Media Relations. Strong arts and design professional with a None focused studies in Communication, Journalism, and Related Programs from Centennial HS and Benson Polytechnic HS.

Read Bio
Xavier Kochhar

Xavier Kochhar

COO and CFO at pocket.watch

Xavier Kochhar is a media and technology executive, entrepreneur, investor, and direct-to-consumer subject matter expert. He is best known as the founder and CEO of The Video Genome Project, a video personalization engine and user experience company that built the largest, broadest, and most granular structured database of video content, by developing a method of finding and creating deep, relevant and granular information on video content (film, tv, online video), programmatically. These granular pieces of altogether new metadata, or “genes”, were then used to provide hyper-personalized recommendations and targeted content to consumers. Because of its accuracy and simplicity in the video discovery process, the company became known as “the Google of video”. The Video Genome Project was acquired by Hulu to create a more powerful way to surface the right content to the right user at the right time, in both live and on-demand environments. Currently a senior executive within AT&T, Inc., Xavier was most recently the chief strategy and business development officer of WarnerMedia's Direct-to-Consumer division. He was previously the Managing Partner of Medialink, acq. by Ascential plc (LSE: ASCL), and has served in senior executive roles at The Walt Disney Company, the William Morris Agency (WME), and L.E.K. Consulting.

Read Bio
Xuanzi Han

Xuanzi Han

Senior Software Engineer at Monte Carlo

Xuanzi Han is a senior software engineer at Monte Carlo, where she develops backend systems to power the company's category creating data observability platform. Previously, she worked as a senior software engineer on Uber's Marketplace team, helping create the algorithms and software driving decision making on the Uber app. In her spare time, she loves hiking, traveling, and hanging out with her beagle, Data.

Read Bio
Zac Yung-Chun Liu

Zac Yung-Chun Liu

Head of AI Research at Andromeda 360 AI

Zac is a Head of AI at Andromeda 360 Inc, a startup building a new generation ML platform to empower data scientist to do end-to-end ML with ease. He also holds a Research Associate position at Stanford University. Zac has worked as lead data scientist and ML engineer in various companies and research institutions in the past 10 years, including Hypergiant, QuantumScape, Driscoll's, Google, Monterey Bay Aquarium, Smithsonian, and NASA. He has been leading R&D and MLOps efforts and building AI systems in multiple domains, such as manufacturing sensory, remote sensing and imaging, supply chain, bioscience, and enterprise cloud product development. He has been invited to present at top ML conferences, such as NeurIPS and TensorFlow World and has 6 peer-reviewed publications in applied ML. He also has 7 open source development in data-centric AI, computer vision, natural language processing, as well as image processing for NASA Cassini spacecraft.

Read Bio
Zia Khan

Zia Khan

Computer Systems Analyst and Data Scientist at LearningFuze

Zia has been advising and helping customers develop solutions for their business challenges using system analysis, data analytics, machine learning, natural language processing, and deep learning. He has worked with companies to implement recommender systems and NLP projects including chatbots. He has implemented solutions based on python machine learning libraries i.e. Sklearn, pandas, keras and tensor flow.

Read Bio
Zitao Liu

Zitao Liu

Head of Engineering, ThinkAcademy International at TAL Education Group

Zitao Liu is Head of Engineering, ThinkAcademy International at TAL Education Group (NYSE:TAL), one of the largest leading education and technology enterprises in China. His research is in the area of machine learning, and includes contributions in the areas of artificial intelligence in education, multimodal knowledge representation and user modeling. He has published his research in 70+ highly ranked conference proceedings, such as NeurIPS, AAAI, WWW, AIED, etc. and serves as the executive committee of the International AI in Education Society and top tier AI conference/workshop organizers/program committees. He has won numerous international AI/ML competitions including NeurIPS 2020 education challenge. He is a recipient of ACM Distinguished Speaker and Beijing Nova Program 2020. Before joining TAL, Zitao was a senior research scientist at Pinterest and received his Ph.D degree in Computer Science from University of Pittsburgh.

Read Bio

Panels

Accessibility and Data Science

Coming to terms with Data Privacy laws

Today’s Media and Entertainment Industry and Your Data Skills

moderated by Moderator

placeholder

Panelists

Schedule of Events

ImData’s schedule of events

8:30 am
9:00 am
9:30 am
10:00 am
Keynotes
Keynotes
Keynotes
8:30 am - 9:45 am
Keynotes
8:30 am - 9:45 am
Keynote - Ron Galperin - Controller at City of LA<br/> Keynote - Soheil Katal - Chief Information Officer at Los Angeles Unified School District<br/> Keynote - Peter Loo - Acting Chief Information Officer, Los Angeles County<br/> Keynote - Eva Pereira - Chief Data Officer at City of Los Angeles<br/> Keynote - Karthik Ramasamy - Head of Streaming at Databricks<br/> Keynote - Lakshmi Sharma - Chief Product and Strategy Officer at Fastly<br/>
Ron Galperin
Soheil Katal
Peter Loo
Eva Pereira
Karthik Ramasamy
Lakshmi Sharma
Controller at City of LA
Chief Information Officer at Los Angeles Unified School District
Acting Chief Information Officer, Los Angeles County
Chief Data Officer at City of Los Angeles
Head of Streaming at Databricks
Chief Product and Strategy Officer at Fastly
Keynotes
08/13/2022
8:30 AM - 9:45 AM
Bovard

Keynote - Ron Galperin - Controller at City of LA
Keynote - Soheil Katal - Chief Information Officer at Los Angeles Unified School District
Keynote - Peter Loo - Acting Chief Information Officer, Los Angeles County
Keynote - Eva Pereira - Chief Data Officer at City of Los Angeles
Keynote - Karthik Ramasamy - Head of Streaming at Databricks
Keynote - Lakshmi Sharma - Chief Product and Strategy Officer at Fastly

10 am
11 am
12 pm
1 pm
2 pm
3 pm
4 pm
5 pm
6 pm
7 pm
8 pm
AI/ ML/ Data Science
AI/ ML/ Data Science
AI Ethics
10:00 am - 10:40 am
By Melinda Thielbar, Data Science Practice Lead and Director of Data Science at Fidelity Investments
AI Ethics
Melinda Thielbar
Fidelity Investments
Data Science Practice Lead and Director of Data Science
AI/ ML/ Data Science
SGM 123

From corporations to governments to private individuals, most of the AI community has recognized the growing need to incorporate ethics into the development and maintenance of AI models. Much of the current discussion, though, is meant for leaders and managers. This talk is directed to data scientists, data engineers, ML Ops specialists, and anyone else who is responsible for the hands-on, day-to-day of work building, productionalizing, and maintaining AI models. We'll give a short overview of the business case for why technical AI expertise is critical to developing an AI Ethics strategy. Then we'll discuss the technical problems that cause AI models to behave unethically, how to detect problems at all phases of model development, and the tools and techniques that are available to support technical teams in Ethical AI development.

Sponsored - Making real-time analytics a reality for digital transformation
10:50 am - 11:30 am
By Fadi Azhari, VP of Marketing at StarRocks
Sponsored - Making real-time analytics a reality for digital transformation
Fadi Azhari
StarRocks
VP of Marketing
AI/ ML/ Data Science
THH 201

- Enterprises are facing an imperative to grow their business to gain competitive advantage at breakneck speed. They need to achieve that by adding new value services efficiently and effectively. - To achieve growth from these new services, enterprises need new insights instantly from their constantly changing data. - Unfortunately, current data infrastructure solutions offer sub-optimal solutions that leave customers wrestling with to achieve their business goals. Why is real-time analytics so difficult? - Data freshness and fast responsiveness are both important and present technical challenges of their own. - User-facing analytics and operational analytics mean supporting thousands of users simultaneously. - You have to do a lot of unnecessary de-normalized tables (de-normalization jobs) in streaming pipelines. It is very difficult to build and maintain. - You can't easily update the data in realtime to analyze business changes. StarRocks re-invents real-time analytics with the only platform uniquely designed for the next generation real-time Enterprise, unleashing the power of business intelligence to help organizations accelerate their digital transformation. StarRocks makes real-time analytics a reality with the fastest, easy-to-use analytics platform on the planet.

AutoDC + AutoML = your AI development superpower
11:40 am - 12:20 pm
By Zac Yung-Chun Liu, Head of AI Research at Andromeda 360 AI & Scott Tarlow, Principal Applied Scientist at Hypergiant
AutoDC + AutoML = your AI development superpower
Zac Yung-Chun Liu
Scott Tarlow
Andromeda 360 AI
Hypergiant
Head of AI Research
Principal Applied Scientist
AI/ ML/ Data Science
SGM 101

The availability of AutoML (automated machine learning) with publicly accessible pre-trained models enable domain experts to automatically build high-quality custom ML applications without much requirement for ML model construction knowledge, which greatly speeds up the ML model development. AutoML has been an essential piece in the model-centric approach in the data science community. ' AutoDC (automated data-centric processing), similar to the purpose of AutoML, is a newly developed open source tool that enables domain experts to automatically and systematically improve datasets by fixing incorrect labels, adding examples that represent edge cases, and applying data augmentation, without much coding requirement and manual process. ' Coming these two frameworks enable the domain experts to improve both dataset and model concurrently and iteratively. ' In this talk, we will showcase 3 data science use cases and examples, which demonstrates the effectiveness of these two frameworks combined and how it empowers domain experts who don't know ML coding to do AI development.

LUNCH
12:20 pm - 1:30 pm
LUNCH
AI/ ML/ Data Science

Open Source Large Knowledge Graph Factory
1:30 pm - 2:10 pm
By Russell Jurney, Founder at Graphlet AI
Open Source Large Knowledge Graph Factory
Russell Jurney
Graphlet AI
Founder
AI/ ML/ Data Science
SAL 101

The knowledge graph and graph database markets have long asked themselves: why aren't we larger? The vision of the semantic web was that many datasets could be cross-referenced between independent graph databases to map all knowledge on the web from myriad disparate datasets into one or more authoritative ontologies which could be accessed by writing SPARQL queries to work across knowledge graphs. The reality of dirty data made this vision impossible. Most time is spent cleaning data which isn't in the format you need to solve your business problems. Multiple datasets in different formats each have quirks. Deduplicate data using entity resolution is an unsolved problem for large graphs. Once you merge duplicate nodes and edges, you rarely have the edge types you need to make a problem easy to solve. It turns out the most likely type of edge in a knowledge graph that solves your problem easily is defined by the output of a Python program using the machine learning. For large graphs, this program needs to run on a horizontally scalable platform PySpark and extend rather than be isolated inside a graph databases. The quality of developer's experience is critical. In this talk I will review an approach to an Open Source Large Knowledge Graph Factory built on top of Spark that follows the ingest / build / refine / public / query model that open source big data is based upon.

Perfect Viral Ad prediction of Super bowl 2022 using Tease, Test, Tweak and Target method
3:10 pm - 3:50 pm
By Arif Ansari, Professor at University of Southern California
Perfect Viral Ad prediction of Super bowl 2022 using Tease, Test, Tweak and Target method
Arif Ansari
University of Southern California
Professor
AI/ ML/ Data Science
GFS 106

Super Bowl Ad cost $7 million and each year a few Super Bowl ads go viral. The traditional A/B testing does not predict virality. Some highly shared ones reach over 60 million organic views, which can be more valuable than views on TV. Not only are these voluntary, but they are typically without distraction, and win viewer engagement in the form of likes, comments, or shares. A Super Bowl ad that wins 69 million views on YouTube (e.g., Alexa Mind Reader) costs less than 10 cents per quality view! However, the challenge is triggering virality. We developed a method to predict virality and engineer virality into Ads. 1. Prof. Gerard J. Tellis and co-authors recommended that advertisers use YouTube to tease, test, and tweak (TTT) their ads to maximize sharing and viewing. 2022 saw that maxim put into practice. 2. We developed viral Ads prediction using two scientific models: a. Prof. Gerard Tellis et al.'s model for viral prediction b. Deep Learning viral prediction using social media effect 3. The model was able to identify all the top 15 Viral Ads it performed better than the traditional agencies. 4. New proposed method is Tease, Test, Tweak, Target and Spots Ad.

Democratizing AI Across Clouds: Low-Cost, Easy-to-Deploy Machine Learning
4:10 pm - 4:50 pm
By John Thorpe, Head of Product at BreezeML
Democratizing AI Across Clouds: Low-Cost, Easy-to-Deploy Machine Learning
John Thorpe
BreezeML/UCLA
Head of Product
AI/ ML/ Data Science
THH 201

Machine learning (especially deep learning) is becoming increasingly complex and expensive. Many companies build their core businesses (e.g., self-driving, credit card fraud detection, item recommendation, etc.) upon continuous model training and/or inferencing, which is typically performed with dozens or even hundreds of GPU machines on a (public or on-premise) cloud. While a cloud-based environment makes it possible for these jobs to dynamically scale with load changes (e.g., user requests), running these jobs under the cloud's pay-as-you-go pricing model incurs large monetary costs, which would rapidly grow with the model size/complexity, the size of datasets, and the number of users. BreezeML democratizes AI/ML by helping AI companies significantly increase their performance-per-dollar by making effective use of preemptible GPU instances. Rooted in years of research at UCLA and Princeton, BreezeML provides (1) a preemption-resilient software system that allows users to reliably run ML training/inference jobs on preemptible instances (such as spot instances) and (2) a virtual cloud interface that performs intelligent selection and scheduling of (spot and on-demand) instances to minimize the monetary costs with strong SLA guarantees. Currently, BreezeML provides two services: 1. An API server (http://windmill.breezeml.ai/apis/) that allows ML engineers to upload batch jobs for free trails. It also allows customers to use their own cloud (e.g., AWS) credential to log in and use BreezeML to run jobs under their own cloud configurations. 2. We provide a docker image of the Breeze runtime, which includes the Breeze-enhanced Pytorch/Tensorflow/XGBoost as well as a new K8S-based orchestration system that can be easily deployed in the user's local environment (compliant with the user's local security policies). Our runtime allows the user to (a) use cheap spot instances in the cloud or (b) sharing resources between (low-priority) training and (high-priority) inference jobs in their on-premise cluster, thereby significantly improving GPU resource utilization. Experiments across a wide range of vision, language, and classification models demonstrate that BreezeML improve the performance-per-dollar by an average of 3 times. Our approach also eliminates the need of resource over-provisioning in on-premise clusters by allowing (high-priority) inference jobs to safely preempt (low-priority) training jobs.

Transformers for NLP
5:00 pm - 5:40 pm
By Ash Pahwa, Instructor at Caltech
Transformers for NLP
Ash Pahwa
Caltech
Instructor
AI/ ML/ Data Science
THH 301

Transformer architecture was proposed by Google Brain in 2017 to process sequential data. Transformers can be used in Natural Language Processing (NLP) and Computer Vision applications. Transformer architecture is based on the concept of ‘Self-Attention’. Transformers replaced the RNN/LSTM architecture. The major advantages of Transformer architecture are that they are fast and bi-directional. The input text is fed into this architecture in parallel which allows faster processing. The leading Language models BERT (Bidirectional Encoder Representations from Transformers) and GPT (Generative Pre-trained Transformer), are built upon Transformer architecture. BERT was proposed by Google and GPT-1/2/3 was proposed by OpenAI. BERT Language Model is included in Google Search Engine. HuggingFace web portal provides many popular Transformers in different flavors. Transformer can be used for all Natural Language Processing (NLP) applications like sentiment analysis, translation, auto-completion, named entity recognition, automatic question- answering and many more. Transformers can also be used for generating artificial text, which is indistinguishable from text generated by humans. This talk will briefly cover the theory of Transformers. Next it will focus on how to fine tune the standard Transformer library (downloaded from Hugging Face portal) for a specific application.

Using Google trends data to build product recommendations
5:50 pm - 6:30 pm
By Mike Limcaco, Analytics Specialist / Customer Engineer at Google
Using Google trends data to build product recommendations
Mike Limcaco
Google
Analytics Specialist / Customer Engineer
AI/ ML/ Data Science
SGM 123

Measure trends in a particular topic or search term across Google Search across the US down to the city-level. Integrate these data signals into analytic pipelines to drive product, retail, media (video, audio, digital content) recommendations tailored to your audience segment. We'll discuss how Google unique datasets can be used with Google Cloud smart analytic services to process, enrich and surface the most relevant product or content that matches the ever-changing interests of your local customer segment.

NETWORKING SESSION
6:30 pm - 7:30 pm
ALUMNI PARK
NETWORKING SESSION
AI/ ML/ Data Science

BI/ Reporting/ Business Use Cases
BI/ Reporting/ Business Use Cases
Demystifying the Art of Business Intelligence and Data Analytics
10:00 am - 10:40 am
By Brandon Wong, Lead Software Engineer at Academy of Motion Picture Arts and Sciences
Demystifying the Art of Business Intelligence and Data Analytics
Brandon Wong
Academy of Motion Picture Arts and Sciences
Lead Software Engineer
BI/ Reporting/ Business Use Cases
SGM 101

Business Intelligence is a technology-driven process that analyzes data and forms conclusions to help assist workers to make informed business decisions. From collecting to cleaning, to morphing, to displaying we will address the pain points, tips, and tricks on how to navigate this process of converting data from raw material to a final product. You'll learn: From a high level, the process of bringing data from the "back" to the "front". Tools and best practices for cleaning and displaying data. Understanding the foundations of business intelligence to better execute on objectives. The various ways of displaying data depends on circumstance.

Early cancer detection using higher-order genome architecture and chromatin interactions
11:40 am - 12:20 pm
By My (Angela) Chung, Data Enthusiast at San Jose State University
Early cancer detection using higher-order genome architecture and chromatin interactions
My (Angela) Chung
San Jose State University
Data Enthusiast
BI/ Reporting/ Business Use Cases
SAL 101

Cancer is a complex disease which requires interactions between cell-intrinsic alterations and tumor microenvironment. The connection between epigenetics and genomic structure plays a key role in chromatin interactions and enhancer-promoter communications for transcriptional activities. Alterations of these components in oncogenic signaling pathway potentially cause cancer cell-intrinsic changes and inappropriate instructions to normal cell cycles, leading to abnormal cell growth. ' Topologically associating domains (TADs) and A/B compartments are the main structures of higher-order chromatin structure. These contact domains, chromatin states, super-enhancers, and histone modifications together regulate transcription and gene expression for normal/abnormal cell cycles. ' Several bioinformatics tools were utilized ' FANC for processing raw FASTQ data to Hi-C contact matrices, JuicerTools for obtaining the locations of contact domains on the entire genome, and CoolBox for visualizing chromatin contacts in different cell lines. ' High-resolution chromatin contacts showed dynamic interactions among chromosomal regions in different cell lines. ' Qualitative and quantitative features were comprehensively engineered from 3D chromatin folding and epigenetic regulators using available packages (scikit learn, pytorch, pandas, numpy, matplotlib, etc.). ' XGBoost multi-class classifier achieved the highest accuracy of 80.90% in classifying normal and cancer cell lines based on chromatin interactions, followed by Random Forest at 73.76% and TabNet classifier at 70.00%.

LUNCH
12:20 pm - 1:30 pm
LUNCH
AI/ ML/ Data Science

Embedding medical journeys with machine learning to improve member health at Aetna
1:30 pm - 2:10 pm
By Jai Bansal, Senior Manager, Data Science at Aetna
Embedding medical journeys with machine learning to improve member health at Aetna
Jai Bansal
Aetna
Senior Manager, Data Science
BI/ Reporting/ Business Use Cases
GFS 106

This talk describes an internal data product called Member Embeddings that facilitates modeling of member medical journeys with machine learning. Medical claims are the key data source we use to understand health journeys at Aetna. Claims are the data artifacts that result from our members' interactions with the healthcare system. Claims contain data like the amount the provider billed, the place of service, and provider specialty. The primary medical information in a claim is represented in codes that indicate the diagnoses, procedures, or drugs for which a member was billed. These codes give us a semi-structured view into the medical reason for each claim and so contain rich information about members' health journeys. However, since the codes themselves are categorical and high-dimensional (>10K cardinality), it's challenging to extract insight or predictive power directly from the raw codes on a claim. To transform claim codes into a more useful format for machine learning, we turned to the concept of embeddings. Word embeddings are widely used in natural language processing to provide numeric vector representations of individual words. We use a similar approach with our claims data. We treat each claim code as a word or token and use embedding algorithms to learn lower-dimensional vector representations that preserve the original high-dimensional semantic meaning. This process converts the categorical features into dense numeric representations. In our case, we use sequences of anonymized member claim diagnosis, procedure, and drug codes as training data. We tested a variety of algorithms to learn embeddings for each type of claim code. We found that the trained embeddings showed relationships between codes that were reasonable from the point of view of subject matter experts. In addition, using the embeddings to predict future healthcare-related events outperformed other basic features, making this tool an easy way to improve predictive model performance and save data scientist time.

How bridging data with design influences today's data products
3:10 pm - 3:50 pm
By Alison Bunce, Product Designer at Meta
How bridging data with design influences today's data products
Alison Bunce
Meta
Product Designer
BI/ Reporting/ Business Use Cases
THH 201

Tech companies are making data products to empower us to work, explore, enjoy and create beyond what we can do now. With new Machine Learning based products paves new ways to capture, understand and communicate data. This drives us to question the tech product landscape. Namely, how these major tech companies have used the practices/capabilities from data science and usability mythologies from experience design to bring value to its users. Last year I spoke about how data and design can learn from each other, and this year, we'll reference today's real world examples to discuss how bridging data science with experience design influences data products. How companies are using design and data to introduce new products? How are methodologies in data science and experience design aiding in data product challenges? How might data collection shape product innovation? How might this affect privacy and security? Could evolving data products influence our behavior?

How to Build a Data-Informed Corporate Culture
4:10 pm - 4:50 pm
By Kirill Makharinsky, Founder & CEO at Enki
How to Build a Data-Informed Corporate Culture
Kirill Makharinsky
Enki
Founder & CEO
BI/ Reporting/ Business Use Cases
THH 301

After interviewing companies like AirBnB, Stripe, Looker, and Revolut, Kirill learned that what differentiates the companies that have grown quickly and successfully is the quality of their data culture. Better data skill sets are not only nice to have, they offer a fundamental change within any company, and every metric is impacted positively when you can work with data. Kirill can share his five steps to implementing a strong data culture within any industry below: Aim for everyone to independently access data Don't rely solely on BI tools Level the playing field Automate what you can Bottom-up and top-down

Real world consumer segmentation
5:00 pm - 5:40 pm
By Jaysen Gillespie, Head of Analytics and Data Science at RTB House
Real world consumer segmentation
Jaysen Gillespie
RTB House
Head of Analytics and Data Science
BI/ Reporting/ Business Use Cases
SGM 123

1. Shopkick has over 30M downloads, but the userbase is very heterogeneous. Anecdotal evidence indicated a wide variety of users for whom the app holds long-term appeal. 2. Marketing and other teams challenged Analytics to get beyond basic summary statistics and develop a holistic segmentation of the userbase. 3. Shopkick's data science team used SQL and python to gather data, clean data, and then perform a data-driven segmentation using a k-means algorithm. 4. Interpreting the results is more work -- and more fun -- than running the algo itself. We'll discuss how we transform from ""segment 1"", ""segment 2"", etc. to something that non-analytics users (Marketing, Operations, etc.) could actually benefit from. 5. So what? How did team across Shopkick change their approach given what Analytics had discovered.

Self-Service Success and Data Products
5:50 pm - 6:30 pm
By Chirag Katbamna, Senior Manager at Accenture
Self-Service Success and Data Products
Chirag Katbamna
Accenture
Senior Manager
BI/ Reporting/ Business Use Cases
SGM 101

We have grown past the traditional reporting off of centralized EDW data store. Each department wants to be empowered to do their own analytics, but this creates new challenges in areas of governance, security, access and monitoring. How do we do this the right way? - Explore the concept of Data Mesh - Explore what is Data Product - Explore how to implement successful Self-Service - Pushing the limits of new capabilities

NETWORKING SESSION
6:30 pm - 7:30 pm
ALUMNI PARK
NETWORKING SESSION
AI/ ML/ Data Science

Data 4 Good
Data 4 Good
Women in Data
10:00 am - 10:40 am
By Rebecca Friedman, Founder & President at Koru Strategies
Women in Data
Rebecca Friedman
Koru Strategies
Founder & President
Data 4 Good
THH 201

While great gender parity strides have been made in fields like web development and user experience (UX) design, data science "" relatively newer concentration"" still has a ways to go. Among technical education fields, data science is shown to have the lowest representation of female students, at just 35.3%. That lack of diversity is a serious issue. AI algorithms are susceptible to bias, so building them requires a team that includes a wide range of views and experiences. This session will focus on how to build an organization that supports the recruitment, mentorship, and career development of women in data.

Sponsored - Finding true purpose after falling to addiction, and inspiration to transform a broken healthcare system
10:50 am - 11:30 am
By David Sarabia, Founder/ CEO at inRecovery & Sig Narvaez, Executive Solution Architect at MongoDB
Sponsored - Finding true purpose after falling to addiction, and inspiration to transform a broken healthcare system
David Sarabia
Sig Narvaez
inRecovery
MongoDB
Founder/ CEO
Executive Solution Architect
Data 4 Good
SAL 101

As a bullied kid, I found refuge in computers and taught myself to code at 8. By 26, I had two successful tech exits and moved to NYC. A weekend party habit led to daily drug use and a spiral to heroin and homelessness. In 2016, after a friend�s overdose woke me up. I checked myself into rehab and quickly realized I was there for a bigger purpose. Healthcare is very broken. From legacy systems, inefficiencies, and poor customer experience. What if we could dramatically improve care models by leveraging data, personalizing treatment, and creating beautiful patient experiences? Ever worked in an industry that felt antiquated? Learn how we use MongoDB to transform addiction care and help people thrive in life!

Human Capital Growth Analytics
11:40 am - 12:20 pm
By Aboli Khairnar, Data Scientist at Citi Ventures Inc.
Human Capital Growth Analytics
Aboli Khairnar
Citi Ventures Inc.
Data Scientist
Data 4 Good
THH 301

The majority of traditional corporate valuation methods are solely based on tangible indicators ' sales growth, gross profit, cash flow, and operational performance ' which are the result and don't reflect the underlying process which makes the organization successful in the longer run. One of the most important intangible assets that don't appear directly on the balance sheet is human capital. In this research, we focus on the role that workforce skills, education, and knowledge play towards organizational success using state-of-the-art Machine Learning techniques. We find that investments in human capital not only play a crucial role in organizational growth but also have a causal relationship.

LUNCH
12:20 pm - 1:30 pm
LUNCH
AI/ ML/ Data Science

Improving disaster response with machine learning
1:30 pm - 2:10 pm
By Antje Barth, Principal Developer Advocate, AI/ML at AWS & Chris Fregly, Principal Engineer, AI & ML at AWS
Improving disaster response with machine learning
Antje Barth
Chris Fregly
AWS
AWS
Principal Developer Advocate, AI/ML
Principal Engineer, AI & ML
Data 4 Good
SGM 123

The frequency and severity of natural disasters are increasing. In response, governments, businesses, nonprofits, and international organizations are placing more emphasis on disaster preparedness and response. Many organizations are accelerating their efforts to make their data publicly available for others to use. Repositories such as the Registry of Open Data on AWS and Humanitarian Data Exchange contain troves of data available for use by developers, data scientists, and machine learning practitioners. In this session, see how a community of developers came together though the AWS Disaster Response hackathon to build models to support natural disaster preparedness and response.

Data-Driven Optimization of plant health and water use
3:10 pm - 3:50 pm
By Graham Symmonds, Co-founder & Chief Executive Officer at Pluvial
Data-Driven Optimization of plant health and water use
Graham Symmonds
Pluvial
Co-founder & Chief Executive Officer
Data 4 Good
SGM 101

The world is on a collision course between increasing water volatility and the need to produce an increasing amount of food. Water is the cornerstone of agricultural products for used directly for food and to sustain livestock. Indeed, water is the most important and highest cost input for agricultural production. To meet this reality, Pluvial.AI has developed a platform that uses a combination of remote sensing, meteorological data and analytics to provide farmers with near real-time assessments of plant health while determining the optimum watering needs to maximize crop yield.

Tracking the Hidden Wealth of Russian Oligarchs
4:10 pm - 4:50 pm
By Mark Quinsland, Sr. Field Engineer at Neo4j
Tracking the Hidden Wealth of Russian Oligarchs
Mark Quinsland
Neo4j
Sr. Field Engineer
Data 4 Good
SAL 101

Luxury yachts, football teams, and mansions are no longer safe havens for the illicit profits of Russian Oligarchs with ties to Putin. Assets are being identified and seized with benefits flowing to causes in Ukraine. This presentation covers: - How are friends and relatives of Putin sheltering immense profits - Graphs and other tools being used to identify sources & destinations of illicit wealth - Latest asset seizures - New regulations to expose hidden investors

When is a housing bubble not a bubble? A look at problems and solutions to the housing crisis.
5:00 pm - 5:40 pm
By Preet Singh, CEO at Enginelitix
When is a housing bubble not a bubble? A look at problems and solutions to the housing crisis.

Los Angeles house rents have risen by two-thirds in the past decade, and property prices have risen by over 40% in the past three years alone, as part of a nationwide "bubble". Several mechanisms or solutions have been proposed or implemented with varying results. In this talk we look at if and when these mechanisms work, the side-effects, and why definitions are important.

Air Quality Analytic Center Framework (AQACF)
5:50 pm - 6:30 pm
By Jason Kang, Software Engineer at NASA-JPL
Air Quality Analytic Center Framework (AQACF)
Jason Kang
NASA-JPL
Software Engineer
Data 4 Good
THH 201

* Develop an Analytic Center Framework (ACF) for Air Quality in support of the NASA AIST air quality technology innovation effort, Air Quality Analytics Center Framework (AQACF) * Harmonize air quality data sets, models, and algorithms to facilitate analysis and projections of air quality across those sources. * Demonstrate analysis application area will focus on air pollution in large cities (e.g., Los Angeles) * Generalize framework to facilitate analyses for air quality applications more broadly Coordinate with other AIST-18 efforts including GEOS-Chem, GCHP, Predicting What We Breathe, etc.

NETWORKING SESSION
6:30 pm - 7:30 pm
ALUMNI PARK
NETWORKING SESSION
AI/ ML/ Data Science

Data Engineering
Data Engineering
Customer-Driven Data Engineering
10:00 am - 10:40 am
By Emad Georgy, CTO at Georgy Technology Leadership
Customer-Driven Data Engineering
Emad Georgy
Georgy Technology Leadership
CTO
Data Engineering
SAL 101

Getting customers engaged and excited about data architecture plans How to integrate UX practices into Data Engineering Data Governance is bullshit - why? Applying performance, scale and usability tests to your Data Engineering journey

Sponsored - Why Database Modernization Makes Your Data Decisions More Critical Than Ever
10:50 am - 11:30 am
By Anil Inamdar, VP & Head of Data Solutions at Instaclustr
Sponsored - Why Database Modernization Makes Your Data Decisions More Critical Than Ever
Anil Inamdar
Instaclustr
VP & Head of Data Solutions
Data Engineering
SGM 123

The most modernized enterprises utilize polyglot architecture, applying the best-suited database technologies to each of their organization's particular use cases. To successfully implement such an architecture, though, you need a thorough knowledge of the expansive NoSQL data technologies now available. Attendees of this Data Con LA presentation will come away with: -- A solid understanding of the decision-making process that should go into vetting NoSQL technologies and how to plan out their data modernization initiatives and migrations. -- They will learn the types of functionality that best match the strengths of NoSQL key-value stores, graph databases, columnar databases, document-type databases, time-series databases, and more. -- Attendees will also understand how to navigate database technology licensing concerns, and to recognize the types of vendors they'll encounter across the NoSQL ecosystem. This includes sniffing out open-core vendors that may advertise as “open source,"" but are driven by a business model that hinges on achieving proprietary lock-in. -- Attendees will also learn to determine if vendors offer open-code solutions that apply restrictive licensing, or if they support true open source technologies like Hadoop, Cassandra, Kafka, OpenSearch, Redis, Spark, and many more that offer total portability and true freedom of use.

Data Streaming with Kafka
11:40 am - 12:20 pm
By Jie Chen, Manager Advisory at KPMG
Data Streaming with Kafka
Jie Chen
KPMG
Manager Advisory
Data Engineering
GFS 106

Data is the new oil. However, many organizations have fragmented data in siloed line of businesses. In this topic, we will focus on identifying the legacy patterns and their limitations and introducing the new patterns packed by Kafka's core design ideas. The goal is to tirelessly pursue better solutions for organizations to overcome the bottleneck in data pipelines and modernize the digital assets for ready to scale their businesses. In summary, we will walk through three uses cases, recommend Dos and Donts, Take aways for Data Engineers, Data Scientist, Data architect in developing forefront data oriented skills.

LUNCH
12:20 pm - 1:30 pm
LUNCH
AI/ ML/ Data Science

Building a serverless data processing pipeline with PySpark on cloud
1:30 pm - 2:10 pm
By Suman Debnath, Principal Developer Advocate(Data Engineering) at Amazon Web Service
Building a serverless data processing pipeline with PySpark on cloud
Suman Debnath
Amazon Web Service
Principal Developer Advocate(Data Engineering)
Data Engineering
THH 201

Data is all over the place, and what matters is how we manage that data and make sense out of it and take some meaningful data driven decision. In this session we will discuss about whole data engineering pipeline, starting from data collection, processing, analysis and visualization in a complete serverless fashion. We will pick some opensource dataset and shall store and process it on cloud(AWS). While the focus would be more on the general understanding of data pipeline aspects of data engineering, but during the process we will learn few of the AWS services which can help us to achieve our goal in an effective and efficient way.

Sponsored - What's new with MongoDB 6.0 and Atlas
2:20 pm - 3:00 pm
By Sig Narvaez, Executive Solution Architect at MongoDB
Sponsored - What's new with MongoDB 6.0 and Atlas
Sig Narvaez
MongoDB
Executive Solution Architect
Data Engineering
SGM 123

MongoDB is now a Developer Data Platform. Come learn what�s new in the 6.0 release and Atlas following all the recent announcements made at MongoDB World 2022. Topics will include - Atlas Search which combines 3 systems into one (database, search engine, and sync mechanisms) letting you focus on your product's differentiation. - Atlas Data Federation to seamlessly query, transform, and aggregate data from one or more MongoDB Atlas databases, Atlas Data Lake and AWS S3 buckets - Queryable Encryption lets you run expressive queries on fully randomized encrypted data to meet the most stringent security requirements - Relational Migrator which analyzes your existing relational schemas and helps you design a new MongoDB schema. - And more!

Key Open Source Databases Strategies that Shape Business in 2022
3:10 pm - 3:50 pm
By Peter Zaitsev, CEO & Co-founder at Percona
Key Open Source Databases Strategies that Shape Business in 2022
Peter Zaitsev
Percona
CEO & Co-founder
Data Engineering
THH 301

It has been an exciting year in the open source database industry, with more choice, more cloud, and key changes in the industry. We will dive into the key developments over 2022, including the most important opensource database software releases in general, the significance of cloud-native solutions in a multi-vendor, multi-cloud world, the new criticality of security challenges, and the evolution of the open source software industry.

Modernizing Analytics & AI for today's needs: Intuit TurboTax Case Study
4:10 pm - 4:50 pm
By Ravi Pillala, Chief Data Architect & Distinguished Engineer at Intuit
Modernizing Analytics & AI for today's needs: Intuit TurboTax Case Study
Ravi Pillala
Intuit
Chief Data Architect & Distinguished Engineer
Data Engineering
SGM 123

TurboTax is one of the well known consumer software brand which at its peak serves 385K+ concurrent users. In this session, We start with looking at how user behavioral data & tax domain events are captured in real time using the event bus and analyzed to drive real time personalization with various TurboTax data pipelines. We will also look at solutions performing analytics which make use of these events, with the help of Kafka, Apache Flink, Apache Beam, Spark, Amazon S3, Amazon EMR, Redshift, Athena and Amazon lambda functions. Finally, we look at how SageMaker is used to create the TurboTax model to predict if a customer is at risk or needs help.

MySQL, JSON & You: Perfect Together
5:00 pm - 5:40 pm
By Scott Stroz, MySQL Developer Advocate at Oracle
MySQL, JSON & You: Perfect Together
Scott Stroz
Oracle
MySQL Developer Advocate
Data Engineering
SGM 101

in this session we discuss storing JSON in our database and how the tools in MySQL can make that task a bit easier. We will start with a discussion of why you might want to store data as JSON as opposed to other storage methods. This discussion will include some real world examples. We will then talk about the different ways in which MySQL can help us achieve that goal and how we might go about deciding what method is best. We will see examples of using the JSON data type to store data in a 'normal' table and use SQL commands to retrieve, filter and sort that data. We will also show how you can validate your JSON schema. Finally, we will talk about MySQL Document Store which allows developers to tap into the best features of a NOSQL database as well as a traditional SQL based database. Attendees of this session will have the following takeaways: - An understanding of when it might be a better alternative to store data as JSON - How to create a JSON column in a database and provide optional validation for the JSON schema. - How to store and retrieve JSON data, as well as filter and sort results based on data in our JSON object - How to set up MySQL Document Store and configure collections - How to store and retrieve documents, as well as filter and sort documents returned from MySQL Document Store

Supercharge your Snowflake Data Cloud from a Snowflake Datasuphero. Cost, Security, and Performance Optimization Best Practices and Tips
5:50 pm - 6:30 pm
By Frank Bell, Data Thought Leader and Snowflake SME at Accenture - CEO at ITS
Supercharge your Snowflake Data Cloud from a Snowflake Datasuphero. Cost, Security, and Performance Optimization Best Practices and Tips
Frank Bell
Accenture
Data Thought Leader and Snowflake SME
Data Engineering
SAL 101

We will cover all aspects of optimizing your Snowflake Data Cloud including: *Dive deep into how Snowflake pay as you go costs work and how by utilizing our proven optimization tools - Snoptimizer SaaS Snowflake Optimizer - https://snoptimizer.com/ , scripts, and architecture techniques you typically can save 10-40++% on your existing Snowflake Account costs. *Explain how Snowflake Compute works and proven techniques on how to architect warehouses for both cost and performance efficiency. We cover in depth how snowflake scales BOTH out and in as well as up and down with compute resources. *Explain how Snowflake data storage works with Replication, Time-Travel, and Cloning. We explain these awesome features as well as their downsides if they are used and configured wrongly. *Cover Snowflake cloud services costs and features that have costs related to them, including Snowpipe, Search Optimization, Materialized Views, Auto-clustering, and other recent new cost based features that provide value at a cost. *Finally, we will discuss how you can ensure your Snowflake Account(s) are fully optimized not just for cost but also for security and performance on Snowflake. We will show you security and performance best practices as well as pitfalls to avoid.

NETWORKING SESSION
6:30 pm - 7:30 pm
ALUMNI PARK
NETWORKING SESSION
AI/ ML/ Data Science

Data Infrastructure & Security
Data Infrastructure & Security
Building Field-level Lineage from Scratch for Modern Data Systems
10:00 am - 10:40 am
By Xuanzi Han, Senior Software Engineer at Monte Carlo
Building Field-level Lineage from Scratch for Modern Data Systems
Xuanzi Han
Monte Carlo
Senior Software Engineer
Data Infrastructure & Security

For modern data teams, lineage is a critical component of the data pipeline root cause and impact analysis workflow, as well as a means of ensuring that data, models, and other data assets are healthy and reliable. That being said, the complexity of SQL queries can make it challenging to build lineage manually, particularly at the field level. Xuanzi Han, a member of Monte Carlo's data and product teams, tackled this challenge head-on by leveraging some of the most popular tools in the modern data stack, including dbt, Airflow, Snowflake, and ANother Tool for Language Recognition (ANTLR). In this talk, they share how they designed the data model, query parser, and larger database design for field-level lineage, highlighting learnings, wrong turns, and best practices developed along the way.

Building Field-level Lineage from Scratch for Modern Data Systems
10:00 am - 10:40 am
By Xuanzi Han, Senior Software Engineer at Monte Carlo
Building Field-level Lineage from Scratch for Modern Data Systems
Xuanzi Han
Monte Carlo
Senior Software Engineer
Data Infrastructure & Security
GFS 106

For modern data teams, lineage is a critical component of the data pipeline root cause and impact analysis workflow, as well as a means of ensuring that data, models, and other data assets are healthy and reliable. That being said, the complexity of SQL queries can make it challenging to build lineage manually, particularly at the field level. Xuanzi Han, a member of Monte Carlo's data and product teams, tackled this challenge head-on by leveraging some of the most popular tools in the modern data stack, including dbt, Airflow, Snowflake, and ANother Tool for Language Recognition (ANTLR). In this talk, they share how they designed the data model, query parser, and larger database design for field-level lineage, highlighting learnings, wrong turns, and best practices developed along the way.

Modern Data Architecture
10:50 am - 11:30 am
By Sonny Khan, Senior Data Analytics Specialist at AWS
Modern Data Architecture
Sonny Khan
AWS
Senior Data Analytics Specialist
Data Infrastructure & Security
THH 301

As everyone’s data is growing, there is a growing need to become data-driven and harness decision-making from data. The existing and legacy approaches created silos, dark data, and risk. AWS Modern Data Architecture is alleviating these issues and enabling new capabilities through cost effective and purpose-built services and architecture. Learn how it works and how customers have used it to realize their business goals and outcomes.

Customer Lifetime Value's review of a lifetime!
11:40 am - 12:20 pm
By Tatiana Tikhonova, Data Scientist at Sense360/Medallia
Customer Lifetime Value's review of a lifetime!
Tatiana Tikhonova
Sense360/Medallia
Data Scientist
Data Infrastructure & Security
THH 201

CLTV is like a mythical creature in the world of business and data — everyone talks about it, some may even get a brief glimpse of it, but no one knows how to approach it, let alone dare to tame it. In this short but informative talk, you will be offered an opportunity to learn about the caveats and assumption of a metric that is on everyone's mind, while being exposed to seasoned frameworks and fresh perspectives. You will walk the well known road laid by the industry standard methods as well as unconventional paths hiding deep in the Data Science forests, with your trusted guide being Tatiana Tikhonova, a Lead Data Professional at Medallia.

LUNCH
12:20 pm - 1:30 pm
LUNCH
AI/ ML/ Data Science

Unlock the full potential of Data Platforms through Automation
1:30 pm - 2:10 pm
By Jagjit Dhaliwal, VP, Global CIO Industry Leader at UiPath
Unlock the full potential of Data Platforms through Automation
Jagjit Dhaliwal
UiPath
VP, Global CIO Industry Leader
AI/ ML/ Data Science
THH 301

Unlock the full potential of analytics platforms and AI/ML capabilities using automation. The hyper-automation platforms can help extend the capabilities to the sources and systems which are unreachable otherwise through regular integrations. Following are the phases of the data life cycle in which automation can assist. 1. Gather data from challenging sources: Use automation to import data from websites, legacy applications and challenging documents 2. Structure and consolidate: Extend data preparation capabilities by structuring and aggregating data from legacy sources. 3. Take action where you make decisions: Take actions based on your insights. Trigger automations from within BI and dashboard platforms. 4. Rapidly integrate with LOB systems: Connect disparate LOB systems like SAP, Oracle, Salesforce and many more. 5. Seamlessly extend workflows with the automation processes: Extend existing workflows and automations by incorporating additional sources, triggers and outputs.

Sponsored - Understand and Discover Data in your Organization
2:20 pm - 3:00 pm
By Garnie Bolling, Senior Solutions Architect at Boomi
Sponsored - Understand and Discover Data in your Organization
Garnie Bolling
Boomi
Senior Solutions Architect
AI/ ML/ Data Science
THH 201

Data proliferation is growing as enterprises progress on their digital journey. Over 60% of an organization’s data is unknown, dormant, or underutilized. Imagine if you had the right data in the right context at the right time? What if all your data was connected, integrated, and truthful? Picture frictionless migrations, productive users and happy stakeholders. Boomi Enterprise Data Catalog and Preparation can take you there. In this hands on session, you will learn: - Learn why a Data Catalog is so important to a successful Data Governance strategy - See how to use Natural Language to Search and Discover data, using AI and ML to understand what data was cataloged - How data stewards can Collaborate, Contribute and Enrich the catalog by adding Context to the data - Develop Data Dictionaries and Business Glossaries to define data sets within the catalog - How it automatically Identifies and masks sensitive data - How ML uses the catalog to derive new data sets and load them directly into BI and Data Warehouse/Lake Tools to get the data science team the data they need more quickly

Moving Data at Scale to AWS
3:10 pm - 3:50 pm
By George Mansoor, Chief Information Systems Officer at California State University
Moving Data at Scale to AWS
George Mansoor
California State University
Chief Information Systems Officer
Data Infrastructure & Security
SGM 123

Overview of the CSU Data Architecture on moving on-prem ERP data to the AWS Cloud at scale using Delphix for Data Replication/Virtualization and AWS Data Migration Service (DMS) for data extracts

Open Source or Open Core in Your Data Layer? What Needs to Be Evaluated Before Diving In.
4:10 pm - 4:50 pm
By Anil Inamdar, VP & Head of Data Solutions at Instaclustr
Open Source or Open Core in Your Data Layer? What Needs to Be Evaluated Before Diving In.
Anil Inamdar
Instaclustr
VP & Head of Data Solutions
Data Infrastructure & Security
SGM 101

Most organizations considering open source and open core cloud technologies as part of their all-important data stack understand they need to rigorously evaluate the software's licensing terms and gauge the long-term health of its community and ecosystem. What still happens less frequently ' but is just as crucial to these risk assessments ' is developing a thorough understanding of the business models governing the commercial organizations attached to each data-layer technology being considered. You must discern the underlying motivations of the vendors or technology providers you depend on to deliver or support open source data-layer software (as well as those vendors with strong influence over its development and maintenance). By acutely understanding these incentives, you can identify if, where, and how they may map to possible risks to your enterprise's adoption and ongoing open source implementation. Don't limit the assessment to licenses and community health -- although both are still very key variables. This session will discuss specifics on what you need to look for and consider when vetting open source data technologies in the cloud as offered by: -- Businesses using OSS as the foundation of their own intellectual property -- Businesses that maintain total control offer the OSS they offer -- Major cloud providers

The Evolution of AI in Cybersecurity
5:00 pm - 5:40 pm
By Michael Melore, Senior Cybersecurity Advisor at IBM
The Evolution of AI in Cybersecurity
Michael Melore
IBM
Senior Cybersecurity Advisor
Data Infrastructure & Security
SAL 101

The session will include views from the panel (and myself) * Review the current challenges, volumes of events, staffing shortages, expertise deficiencies, siloed security controls, * Provide statistics from recent Ponemon Institute reports including the recent Cost of a Data Breach 2021 Report's findings in attack vectors, response/organizational impact and costs attributed to remote workforces, * Provide The impact in cost and response times of AI/Machine Learning etc. * Share the way's AI is used in law enforcement and critical infrastructure protection, * Discuss AI bias and evolving Trust and Validation requirements in AI systems, the necessity and value of AI insight to security and where the industry is moving in AI for security.

Why Data Quality vigilance requires an End-to-End, Automated and Continuous Data Integrity Approach
5:50 pm - 6:30 pm
By Curtis ODell, Global Director Data Integrity at Tricentis
Why Data Quality vigilance requires an End-to-End, Automated and Continuous Data Integrity Approach
Curtis ODell
Tricentis
Global Director Data Integrity
Data Infrastructure & Security
GFS 106

Join me to learn about a new end-to-end data testing approach designed for modern data pipelines that fills dangerous gaps left by traditional data management tools—one designed to handle structured and unstructured data from any source. You'll hear how you can use unique automation technology to reach up to 90 percent test coverage rates and deliver trustworthy analytical and operational data at scale. Several real world use cases from major banks/finance, insurance, health analytics, and Snowflake examples will be presented. Key Learning Objective 1. Data journeys are complex and you have to ensure integrity of the data end to end across this journey from source to end reporting for compliance 2. Data Management tools do not test data, they profile and monitor at best, and leave serious gaps in your data testing coverage 3. Automation with integration to DevOps and DataOps' CI/CD processes are key to solving this. 4. How this approach has impact in your vertical

NETWORKING SESSION
6:30 pm - 7:30 pm
ALUMNI PARK
NETWORKING SESSION
AI/ ML/ Data Science

Emerging Tech
Emerging Tech
Block Chain for master data management
10:00 am - 10:40 am
By Chuck Chekuri, Co-Founder & CDO at togetherMDM
Block Chain for master data management
Chuck Chekuri
togetherMDM
Co-Founder & CDO
Emerging Tech
THH 301

A system for a single source of truth has been the holy grail of many Master data management solutions and is a critical pillar of many data governance models. Existing technology solutions and the myriad of tools fall short in many ways. The main challenge seems to be portions of the truth are embedded in applications, and the rules of the business govern the changes to this master data. The option of establishing a single source of truth outside the applications creates the challenge of managing and coordinating the change across many systems. Most rules to coordinate the change cannot be programmed into existing systems and have to be enforced manually. Existing tooling in this area provides connectors to application sources but cannot resolve conflicting changes to master data. Human intervention is needed. Most implementations of blockchain as implemented today are slow, cannot scale to support millions of transactions in a day and can be very slow and expensive. However, private Ethereum networks with custom “smart contracts” can potentially eliminate human intervention to resolve conflicts and improve change management. Furthermore, blockchain, with its immutable and distributed properties, can work across organizations as it can also serve as a data exchange mechanism. - Existing solutions for single source of truth have insurmountable challenges. - Existing Data Governance processes are manual, costly and error prone. - Immutability and distributed properties of blockchain make it a natural fit. - Smart Contracts can be leveraged to automate Data Governance processes. - Integrating Block chain with existing data governance process is the key.

Collaborative Data Exploration using Conversational AI
11:40 am - 12:20 pm
By Anand Ranganathan, Chief AI Officer at Unscrambl
Collaborative Data Exploration using Conversational AI
Anand Ranganathan
Unscrambl
Chief AI Officer
Emerging Tech
SGM 123

Conversational AI is getting more and more widely used for customer support and employee support use-cases. In this session, I'm going to talk about how it can be extended for data analysis and data science use-cases ... i.e., how users can interact with a bot to ask analytical questions on data in relational databases. This allows users to explore complex datasets using a combination of text and voice questions, in natural language, and then get back results in a combination of natural language and visualizations. Furthermore, it allows collaborative exploration of data by a group of users in a channel in platforms like Microsoft Teams, Slack or Google Chat. For example, a group of users in a channel can ask questions to a bot in plain English like ""How many cases of Covid were there in the last 2 months by state and gender"" or ""Why did the number of deaths from Covid increase in May 2022"", and jointly look at the results that come back. This facilitates data awareness, data-driven collaboration and joint decision making among teams in enterprises and outside. In this talk, I'll describe how we can bring together various features including natural-language understanding, NL-to-SQL translation, dialog management, data story-telling, semantic modeling of data and augmented analytics to facilitate collaborate exploration of data using conversational AI.

LUNCH
12:20 pm - 1:30 pm
LUNCH
AI/ ML/ Data Science

CyberSecurity in Industrial IoT age
1:30 pm - 2:10 pm
By Jauher Zaidi, Founder, Chairman & CINO at Palmchip
CyberSecurity in Industrial IoT age
Jauher Zaidi
Palmchip
Founder, Chairman & CINO
Emerging Tech
SGM 101

Despite being the third most targeted industry for a cyberattack, manufacturing is one of the least prepared. Colonial Pipeline and meat plants were in the news but hundreds others are being attacked each day. Manufacturing is becoming increasingly digitised as the industry is adopting automation, IoT for monitor and control, to a greater extent than ever before. The Industrial Internet of Things (IIoT) is bringing artificial intelligence, cloud computing and robotics into factories. Cyber-physical systems can now integrate all aspects of the supply chain, including operational systems and information systems, and are taking the place of outdated, siloed machines. Any factory making use of these new technologies is known as a Smart Factory, and they're prompting what experts are calling the fourth industrial revolution, or Industry 4.0. Smart Factories will help the manufacturing industry considerably, as digital technology can offer greater efficiency in the production stage, better quality products with fewer mistakes, and more flexibility for working processes. So it's no wonder that manufacturers are moving quickly to update their factories by 2019, 75% of large manufacturers will have incorporated the Industrial IoT in their operations. And by 2022, the Industrial IoT market is expected to be worth $195.47 billion. I will discuss; * The Smart Factory & IoT / OT * Manufacturers are investing in digital * What cybersecurity risks do manufacturers face? * How can the manufacturing industry mitigate risk? * Three key questions every manufacturer should ask themselves before implementing IoT.

Event Sourcing with Apache Pulsar and Apache Quarkus
3:10 pm - 3:50 pm
By David Kjerrumgaard, Developer Advocate at StreamNative
Event Sourcing with Apache Pulsar and Apache Quarkus
David Kjerrumgaard
StreamNative
Developer Advocate
Emerging Tech
SAL 101

I believe that event-sourcing is the best way to implement persistence within a microservices architecture, but it hasn't always been the easiest solution to implement. In this talk, I will demonstrate how these two exciting technologies can be combined into one killer stack that simplifies event sourcing development. I will outline how to use DDD and CQRS concepts as a guide for developing an event sourcing food-delivery application based on Apache Pulsar and Quarkus that is 100% cloud native. Throughout this talk, I will demonstrate several different event sourcing design patterns across multiple microservices to feed multiple real-time dashboards that provide driver location tracking, and heatmaps. I will also highlight some patterns for using an event streaming platform as your event store.

How are NFTs and DeFi Changing Entertainment
4:10 pm - 4:50 pm
By Mariana Danilovic, Managing Director at Infiom, LLC
How are NFTs and DeFi Changing Entertainment
Mariana Danilovic
Infiom, LLC
Managing Director
Emerging Tech
GFS 106

We will address: (1) Community creation and engagement using tokens and NFTs (2) Organization of DAO structures and ways to incentivize Web3 communities (3) DeFi business models applied to Web3 ventures (4) Why Metaverse matters for new entertainment and community engagement models.

PaCMAP ensembles for occupational specializations in the California Cloud Workforce
5:00 pm - 5:40 pm
By Neal Fultz, Principal Consultant at njnm Consulting
PaCMAP ensembles for occupational specializations in the California Cloud Workforce
Neal Fultz
njnm Consulting
Principal Consultant
Emerging Tech
THH 201

The California Cloud Workforce is an initiative in LA-area community colleges to develop the skills for future employment in Cloud and DevOps roles, spearheaded by Santa Monica College. Because there are more than 20 colleges participating and because the technology and required skills evolves rapidly, we have developed an NLP ensemble using federal data to identify occupational specializations in Cloud Computing, and the relevant coursework across many different institutions. Training and using the model consisted of several phases: * Extracting occupational data from the O*NET system and curricula data from Course Outlines of Record * Creating component models using DistilBERT, traditional NLP topic models, and the Bloom taxonomy of educational objectives * Ensembling the component models using PaCMAP * Deployment, and aggregating and visualizing results Using PaCMAP and DistilBERT produced a more parsimonious model that can leverage both transformers architecture and domain specific knowledge, and can be calibrated for Cloud Computing or other programs, and is so easy to manage that students deploy it themselves as part of their coursework.

Practical Solutions to Complex Supply Chain Problems
5:50 pm - 6:30 pm
By Raz Nistor, Sr. Director at Keyrus US
Practical Solutions to Complex Supply Chain Problems
Raz Nistor
Keyrus US
Sr. Director
Emerging Tech
THH 301

The Manufacturing and Consumer Packaged Goods industries are still facing unprecedented negative pressures on their operating profits from ongoing supply chain disruptions. Many of these organizations have turned to AI to solve all their problems. Surprisingly (/sarcasm), many of these same organizations have failed to realize and sustain ROI from their investments in AI. In this discussion, we will review a few solutions to supply chain initiatives we've rolled out and highlight the keys to success that we've identified to help organizations sustain value from applications in AI. In particular, we will discuss end-to-end technical solutions in demand forecasting, machine learning for lead-time optimization, and full network optimization, as well as the governance and organizational structures we have to consider to make sure the value from these solutions scales throughout the organization.

NETWORKING SESSION
6:30 pm - 7:30 pm
ALUMNI PARK
NETWORKING SESSION
AI/ ML/ Data Science

Pre-Recorded Videos (Videos available August 11th)

6 am
7 am
8 am
9 am
10 am
11 am
12 pm
1 pm
AI/ ML/ Data Science
AI/ ML/ Data Science
Integrating data science initiatives in business strategy for small businesses
6:00 am - 7:00 am
By Linda Liu, Head of Data Analytics and Data Science at Hyrecar
Integrating data science initiatives in business strategy for small businesses
Linda Liu
Hyrecar
Head of Data Analytics and Data Science
AI/ ML/ Data Science

Machine learning and artificial intelligence are becoming mainstays of the enterprise business world, but many entrepreneurs and small-business owners still shy away from investing in it. You might not think that small businesses need to use AI in the same way that large companies do, but that is a fallacy. AI can contribute to many facets of business success including data sorting, email marketing, and much more. In small companies, people think there is not enough data volume for AI applications and automatically dismiss it. Stop! AI can still be applied. The core thing is to integrate AI in a cohesive plan outlining the processes for collecting, processing, governing, and eliciting value from data, optimized for ML and data science. This hold true for small companies as well. How we frame our approach to AI initiatives will determine its success. Don't worry, I am not a zealot. I will not tell you AI and ML are the cure-all and will solve all your problems. Some tasks are particularly well suited to these techniques, but not all. What I love about them is the fact that they allow us to tackle difficult problems that might otherwise be too daunting.

Simplifying AI/ML using Databricks featurestore
7:00 am - 8:00 am
By Debu Sinha, Sr Specialist Solutions Architect - AI/ML at Databricks
Simplifying AI/ML using Databricks featurestore
Debu Sinha
Databricks
Sr Specialist Solutions Architect - AI/ML
AI/ ML/ Data Science

1. What are feature stores. 2. Why are they important? 3. Using Databricks and the feature store offering to streamline ml. This hold true for small companies as well. How we frame our approach to AI initiatives will determine its success. Don't worry, I am not a zealot. I will not tell you AI and ML are the cure-all and will solve all your problems. Some tasks are particularly well suited to these techniques, but not all. What I love about them is the fact that they allow us to tackle difficult problems that might otherwise be too daunting.

BI/ Reporting/ Business Use Cases
BI/ Reporting/ Business Use Cases
How to Get into Data Analytics in 2022-2023
6:00 am - 7:00 am
By Christian Bourdeau, Analytics Manager at Activision Blizzard
How to Get into Data Analytics in 2022-2023
Christian Bourdeau
Activision Blizzard
Analytics Manager
BI/ Reporting/ Business Use Cases

Showcase how I pivoted from professional concert photography to data analytics at Blue Chip companies. I will show you the classes I took, the skills I developed, and how you can do the same in 2022 and 2023

Use Cases Unlocked by Business Intelligence Alerts
7:00 am - 8:00 am
By Bob Newstadt, Senior Director of Business Intelligence at HopSkipDrive
Use Cases Unlocked by Business Intelligence Alerts
Bob Newstadt
HopSkipDrive
Senior Director of Business Intelligence
BI/ Reporting/ Business Use Cases

Dashboards are fast to build and user-friendly due to great tools such as Tableau. But there are use cases where dashboards are a poor fit. A year ago the operations team at HopSkipDrive demanded a better solution to replace a suite of existing dashboards. Lacking suitable features in existing tools we built a system that sends real-time alerts to Slack channels. The initial project was a success. What happened over the next few months surprised us: users found 30 additional use cases for real-time alerts. Those alerts have become an integral part of the service improving safety and user experience on the platform. In addition, alert volume is used as a measure of defects, a component of our overall service level score. This talk will deconstruct what we learned. • When are dashboards a poor fit? • What use cases are well-served by real-time alerts? • How was the alert service built? • Using alert volume as a defect score.

Data 4 Good
Data 4 Good
Data Science in water utility industry
6:00 am - 7:00 am
By Jay Kim, Data Scientist at Golden State Water Company
Data Science in water utility industry
Jay Kim
Golden State Water Company
Data Scientist
Data 4 Good

utilize SCADA data and monitoring in real time Water utilities have real time SCADA data in all the water plants and facilities. It is challenging to get the large amount SCADA data and make use of them. * forecasting chemical concentrations in water in advance For water quality, water engineers should know how chemicals concentrations in water are changing. By knowing how the concentrations are changing, they can make right decisions on what to do to make water quality good by optimizing their operations of water quality. For example, by knowing chlorine, nitrite concentrations in advance they can decide when to flush the water pipelines again next. By doing flushing in right time just right amount, they can save costs and labors as well as efficiently operating water quality. * optimize asset spending allocations to spend using machine learning throughout given years. in regulated water utilities, budgets are decided every 3 years by investors. Given budgets and assets should be well allocated to spend so that the given budgets and assets are all well used when the 3 years end. It should not be spent too little or too much. By using machine learning, the optimal asset spending allocation is efficiently decided. Based on the information, project managers and general managers can spend right amount budgets in each month and hit the target in earning test when the 3 years end. * forecasting water supply is for predicting how much water should be purchased for shortage of water or drought. In Southern California, it is very important to forecast how much water will be supplied by producing the water because oftentimes, drought comes. If too little water is supplied, water utilities company purchase the shortage from different areas' water utilities companies. To set up the budget and purchase right amount of water, it is very important to forecast water supply by water production of the water utilities company. By applying machine learning, the water supply can be forecasted. * by forecasting water depth change in wells, engineers can know which wells will dry out by drought. By applying machine learning, the water depth of the wells are forecasted and the wells that will be dry are identified.

Helping California Tackle the COVID-19 Pandemic using Data Analytics
7:00 am - 8:00 am
By Steve Rimar, Founder at Analytica Consulting, LLC
Helping California Tackle the COVID-19 Pandemic using Data Analytics
Steve Rimar
Analytica Consulting, LLC
Founder
Data 4 Good

This talk focuses on our story of how we helped the government of California manage the emergency response to the COVDI-19 pandemic over the past few years. We'll go over how we became part of California's COVID-19 data task force and the challenges that were overcome to provide real-time and accurate COVID-19 statistics for California. Please see http://covid19.ca.gov for more information.

Hispanic Demographics basing on most recent 2020 Decennial Survey: by states and counties
8:00 am - 9:00 am
By Joe Long, President at Summit Capital _California
Hispanic Demographics basing on most recent 2020 Decennial Survey: by states and counties
Joe Long
Summit Capital _California
President
Data 4 Good

Data analysis presentation using Tidycensus on R platform to analyze trends on major race and ethnic demographics, data from U.S. Census decennial data.

How Transparent AI Will Enable More Equitable Products
9:00 am - 10:00 am
By Dipanwita Das, CEO & Co-Founder at Sorcero
How Transparent AI Will Enable More Equitable Products
Dipanwita Das
Sorcero
CEO & Co-Founder
Data 4 Good

70% of AI leaders cannot explain how specific AI model decisions or predictions are made, and only 35% said their organization made an effort to use AI in a way that was transparent and accountable. Ethics and AI have become a central conversation in the tech industry, driven by the lack of understanding of data models, what information they are trained on, and the risk of bias. This is especially critical in sectors like healthcare, where algorithmic bias can leave out significant portions of a population and lead to devastating results. Attendees of this session will: - Understand the inherent challenges in utilizing artificial intelligence in a way that is transparent and accountable. - Why you need to pay more attention to your data models and how you are are using & training AI and deep learning. - How to develop a framework for identifying and overcoming inherent biases in data sets to ensure that your AI is driving more equitable products.

Humans Are Hard, Code is Easy
10:00 am - 11:00 am
By Tom Henricksen, Humans Skill Enabler at Code is Easy
Humans Are Hard, Code is Easy
Tom Henricksen
Code is Easy
Humans Skill Enabler
Data 4 Good

Are you a frustrated developer who feels like they know enough? However, the success you thought you would have is out of reach. You see others who make better strides but why? Is it a skills gap? Come learn how to set yourself apart as a developer and learn the skills of influence and collaboration.

NFTS 4 Good
11:00 am - 12:00 pm
By Venus Banuelos, CEO at IMO
NFTS 4 Good

Using NFT project's to fund mutual aid humanitarian project's. Why NFTS How To use Community to Change the world Building A Bright Tomorrow, Together Shaping The a future With Web3 Environmental Impact - Why Polygon

Unleashing the Power of AI for Education: Using Deep Learning to Better Help Teachers in K-12 Online Education
12:00 pm - 1:00 pm
By Zitao Liu, Head of Engineering, ThinkAcademy International at TAL Education Group
Unleashing the Power of AI for Education: Using Deep Learning to Better Help Teachers in K-12 Online Education
Zitao Liu
TAL Education Group
Head of Engineering, ThinkAcademy International
Data 4 Good

There is a pressing need to investigate approaches of how AI can be leveraged to support teachers in online education. In this talk, I will discuss some recent developments of applying deep learning approaches in AI in education (AIED) field for teachers in K-12 online education. More specifically, I will talk about (1) representation learning in real-world educational contexts; and (2) improving teaching effectiveness via advanced AIED tools. Participants will learn about recent trends and emerging challenges in this topic, representative tools and learning resources to obtain ready-to-use models, and how related models and techniques benefit real-world AIED applications.

Data Engineering
Data Engineering
Hamilton -- a General Purpose Microframework for Scalable Feature Engineering
6:00 am - 7:00 am
By Elijah ben Izzy, Data Platform Engineer at StitchFix
Hamilton -- a General Purpose Microframework for Scalable Feature Engineering
Elijah ben Izzy
StitchFix
Data Platform Engineer
Data Engineering

At Stitch Fix, data is integral to every facet of our business. We run a plethora of dataflows to transform raw data into features that models use to serve customers. We need to scale these dataflows both in code complexity, as we add new capabilities, and in data size, as we attain more customers. To ensure that these workflows don't devolve into an unmaintainable mess of spaghetti code (chalk full of in-place pandas operations), we built and open-sourced hamilton, a pluggable microframework to make scaling and managing complex dataflows easy. To use hamilton, one creates a dataflow by writing simple python functions in a declarative manner. The framework stitches them together, introducing an abstraction to configure and execute these dataflows. In this talk, we'll present the basic concepts of hamilton, discuss its impact at Stitch Fix, and share recent extensions to the project, including integrations with Dask, Spark, and Ray and why we're excited for its future.

OpenSearch: (Just About) Everything You Need to Know About Its Architecture
7:00 am - 8:00 am
By Seth Muthukaruppan, Consultant at Instacluster
OpenSearch: (Just About) Everything You Need to Know About Its Architecture
Seth Muthukaruppan
Instacluster
Consultant
Data Engineering

OpenSearch is an incredibly powerful search engine and analytics suite for ingesting, searching, visualizing, and analyzing your data and it is fully open source. This Apache 2.0-licensed and community-driven collection of technologies harnesses an architecture that combines the powers of Elasticsearch 7.10.2, Kibana 7.10.2 and Apache Lucene. With OpenSearch, users gain a distributed framework featuring particularly powerful scalability, high availability, and database-like capabilities. Attendees at this DataCon LA presentation will come away understanding OpenSearch's architecture and its building-block technology components, including: -- Apache Lucene utilization. Learn how this high-performance Java-based search library utilizes Lucene's inverted search index to delivers incredibly fast search results (while supporting natural language, wildcard, fuzzy, and proximity searches). -- OpenSearch cluster architecture. An OpenSearch cluster is a distributed and horizontally-scalable collection of nodes, which are differentiated based on the operations they perform. Attendees will learn the specific functions of master, master-eligible, data, client, ingest nodes. -- Data organization. Understand how OpenSearch organizes data into indices (which contain documents, which contain fields). -- Internal data structures. Get an in-depth look at how OpenSearch achieves scalability and reliability by breaking up indices into shards and segments, and utilizes translogs. -- Aggregations. See how OpenSearch enables its advanced built-in analytics capabilities through the power of aggregations.

Data Infrastructure & Security
Data Infrastructure & Security
Effective IAM: How Auto OTP Provides the Foundation for Zero Trust Security & Convenience
6:00 am - 7:00 am
By Don Malloy, Chair JCC at OATH
Effective IAM: How Auto OTP Provides the Foundation for Zero Trust Security & Convenience
Don Molloy
OATH
Chair JCC
Data Infrastructure & Security

This presentation will discuss how breaches happen, the growth of these hacks and the cost incurred by various aspects of industry. These breaches continue to happen even with elaborate safeguards in place. The Banking/Financial Industry has sophisticated authentication systems, and yet the move toward Omni-Channel Banking remains elusive. It can more commonly referred to as broken. Knowing who you are connecting with is critical in this world...We have many methods - Biometrics, SMS, PKI Certificates, behavioral but mutual authentication is best by transferring the burden on the service and not the customer. We have Auto OTP: 1. Authentication to online service 2. User Authentication 3. User password auto updating Multi-Channel authentication has been fragmented in the past and moving to an Omni-Channel authentication provides security to the user that their PII data has not been compromised. The attendee will learn how to implement such a system that will provide the Zero Trust security so often needed throughout industry, education and Government.

Multi Cloud Infrastructure made easy
7:00 am - 8:00 am
By Hugo Bergmann, Product Marketing Manager Lyve Cloud and Data Services, Chair JCC at Seagate
Multi Cloud Infrastructure made easy
Hugo Bergmann
Seagate
Product Marketing Manager Lyve Cloud and Data Services
Data Infrastructure & Security

Learn how to start building a thriving multicloud with Lyve Cloud. “I need a cloud solution that offers greater data protection and flexibility and that reduces TCO.” If this sounds familiar, you’re not alone. Many organizations today face the same demands, and they’re turning to multicloud as the solution. In this session, we’ll cover the benefits of such an approach and show how Lyve Cloud can integrate into your public cloud environment for a seamless multicloud experience.

Emerging Tech
Emerging Tech
Quantum Computing: The next new technology in computing
6:00 am - 7:00 am
By Mark Jackson, Quantum Evangelist at Quantinuum
Quantum Computing: The next new technology in computing
Mark Jackson
Quantinuum
Quantum Evangelist
Emerging Tech

Quantum computing is rapidly becoming commercially feasible. Many tech giants - Google, IBM, Honeywell, and Microsoft - are spending billions to far outpace Moore's Law. Last year achieved the major milestone of Quantum Supremacy where it was shown that a quantum computer could greatly outperform a classical computer. Quantum computing offers the promise of solving problems which would be impossible for a classical computer including optimization, anomaly detection, and material design. It also allows unhackable communication. In this presentation I will summarize what quantum computing is and why it is so important. I will sketch the landscape of the field including the hardware, software, and major customers at present. The tool most critical for data analysis - quantum machine learning - will be explained, along with the type of applications it is best suited for. Finally I will explain how you can take the first steps into leveraging quantum computing for your enterprise's benefit. -What is quantum computing -Who are the major players in the field -What is quantum machine learning and what types of problems can it address -How your company can take advantage of this

Web3 and Decentralized Identity
7:00 am - 8:00 am
By Harrison Tang, CEO at Spokeo
Web3 and Decentralized Identity
Harrison Tang
Spokeo
CEO
Emerging Tech

Digital identity is who you are in the digital world, and you should be able to control your own identity. In a world where Big Tech controls data on millions of its users, how do we claim our digital identities? Self-sovereign identity gives people the control over their identities by empowering them as intermediaries of identity-related transactions. People are the platform, not Big Tech or governments. Decentralized identity leverages technologies like blockchain and token-based economy to ensure that the identity infrastructure is not controlled by a few companies. Despite the decentralized infrastructure, federated identity enables identity information to be easily aggregated, processed, and available to use for good. Personalized identity individualizes the sharing of identity information (e.g. selective disclosure) based on different people's needs and sharing contexts. Secure identity makes sure that the access to identity information will be more secure, authenticated, and accountable. And lastly, intelligent identity helps streamline identity management.

XR Interactions for Water Sustaining Behavior Change
8:00 am - 9:00 am
By Suzan Oslin, Experiential Director at UXXR Design & Elektra Grant, Lecturer at California State Polytechnic University-Pomona & Laura Garcia, Director of Clinical Design at Found
XR Interactions for Water Sustaining Behavior Change
Suzan Oslin
Elektra Grant
Laura Garcia
UXXR Design
California State Polytechnic University-Pomona
Found
Experiential Director
Lecturer
Director of Clinical Design
Emerging Tech

Immersive media has been shown to be more effective for learning. It captivates the mind with interactions, involves the whole body through movement, requires eye-hand coordination, absorbs attention with learning by doing, and engages more of the senses. It’s a much stronger sensory experience, it's more memorable, and provides a greater opportunity to lead a participant through some form of transformation. Gamification becomes possible and reinforcements, both positive and negative, can be used to shape behavior. Concrete Oasis is a persistent, city-scale, augmented reality, civic exhibit intended for Pershing Square Park that addresses the water crisis with the intention of engaging the audience with interactions that will lead to lasting behavior change. Hear from the team designing this immersive exhibit about how they are using augmented reality to help shift the collective mindset towards a sustainable water future for LA.

Panels

9:30 am
10:00 am
10:30 am
11:00 am
11:30 am
12:00 pm
12:30 pm
1:00 pm
1:30 pm
2:00 pm
2:30 pm
3:00 pm
3:30 pm
4:00 pm
4:30 pm
Panels
Panels
Accessibility and Data Science
10:15 am - 11:15 am
Accessibility and Data Science
10:15 am - 11:15 am
Accessibility is about designing with usability for all people in mind. When we think about accessibility in data science most data scientists are familiar with using color-blind friendly colors in their data visualizations, but for many, thinking of examples and discussion beyond this is hard. We as data scientists need to design for more diverse populations to make a real impact on the world.
Joe Devon
Jennison Asuncion
Stephan G. Erberich, PhD
John Sweet
Founding partner at Diamond and Co-Founder of GAAD
Co-Founder and Vice-Chair at GAAD
Chief Data Officer & Director Biomedical Informatics at Children's Hospital Los Angeles
Sr. Director, Accessibility and Technology Compliance at Pluto TV
10:15 AM - 11:15 AM
Bovard

Accessibility is about designing with usability for all people in mind. When we think about accessibility in data science most data scientists are familiar with using color-blind friendly colors in their data visualizations, but for many, thinking of examples and discussion beyond this is hard. We as data scientists need to design for more diverse populations to make a real impact on the world.

Today’s Media and Entertainment Industry and Your Data Skills
1:30 pm - 2:30 pm
Today’s Media and Entertainment Industry and Your Data Skills
1:30 pm - 2:30 pm
Top media and entertainment executives from major Studio, Streaming, Gaming, Advertising, and Social Media companies come together to discuss how they use structured datasets and their derivative insights to drive key business decisions that have forever changed how we consume entertainment content. These senior leaders will also share what data, expertise, and innovative thinking is needed both now and for many years on. Come see if you have some of the skills or desire to help the industry get to the future faster.
Xavier Kochhar
Keith Camoosa
Tracey Trench
Eric Wilker
COO and CFO at pocket.watch
Chief Addressability Officer at Dentsu
Founder/CEO at T2 Content Ventures
Head of Partnerships and Business Development at Amazon/ IMDB
1:30 PM - 2:30 PM
Bovard

Top media and entertainment executives from major Studio, Streaming, Gaming, Advertising, and Social Media companies come together to discuss how they use structured datasets and their derivative insights to drive key business decisions that have forever changed how we consume entertainment content. These senior leaders will also share what data, expertise, and innovative thinking is needed both now and for many years on. Come see if you have some of the skills or desire to help the industry get to the future faster.

Startup Showcase Winners
2:45 pm - 3:15 pm
Startup Showcase Winners
Aware Health
Griot
ServiceMob
2:45 PM - 3:15 PM
Bovard

Startup Showcase Winners 1st Place - Aware Health 2nd Place - Griot 3rd Place - ServiceMob

Coming to terms with Data Privacy laws
3:30 pm - 4:30 pm
Coming to terms with Data Privacy laws
3:30 pm - 4:30 pm
Despite numerous proposals over the years, there is no one comprehensive federal law that governs data privacy in the U.S. There's a complex patchwork of sector-specific and medium-specific laws, including laws and regulations that address telecommunications, health information, credit information, financial institutions and marketing. The U.S. has hundreds of sectoral data privacy and data security laws among its states. U.S. state attorneys general oversee data privacy laws governing the collection, storage, safeguarding, disposal and use of personal data collected from their residents, especially regarding data breach notifications and the security of Social Security numbers. Some apply only to governmental entities, while others apply only to private entities, and some apply to both. The most comprehensive state data privacy legislation to date is the California Consumer Privacy Act (CCPA). Signed into law on June 28, 2018, it went into effect on January 1, 2020. The CCPA is cross-sector legislation that introduces important definitions and broad individual consumer rights and imposes substantial duties on entities or persons that collect personal information about or from a California resident.
Jeanne Holm
Martin Devon
Parker Lowrey
Deputy Mayor at City of Los Angeles
Founder at Autonomous Digital Healthcare Transformation
SVP of Data at Whip Media
3:30 PM - 4:30 PM
Bovard

Despite numerous proposals over the years, there is no one comprehensive federal law that governs data privacy in the U.S. There's a complex patchwork of sector-specific and medium-specific laws, including laws and regulations that address telecommunications, health information, credit information, financial institutions and marketing. The U.S. has hundreds of sectoral data privacy and data security laws among its states. U.S. state attorneys general oversee data privacy laws governing the collection, storage, safeguarding, disposal and use of personal data collected from their residents, especially regarding data breach notifications and the security of Social Security numbers. Some apply only to governmental entities, while others apply only to private entities, and some apply to both. The most comprehensive state data privacy legislation to date is the California Consumer Privacy Act (CCPA). Signed into law on June 28, 2018, it went into effect on January 1, 2020. The CCPA is cross-sector legislation that introduces important definitions and broad individual consumer rights and imposes substantial duties on entities or persons that collect personal information about or from a California resident.

Tutorials

1:30 pm
2:00 pm
2:30 pm
3:00 pm
3:30 pm
Tutorials
Tutorials
Intro to Data Science
1:30 pm - 3:30 pm
By Zia Khan, Computer Systems Analyst and Data Scientist at LearningFuze
Intro to Data Science
1:30 pm - 3:30 pm
Data Science tutorial is designed for people who are new to Data Science. This is a beginner level session so no prior coding or technical knowledge is required. Just bring your laptop with WiFi capability. The session starts with a review of what is data science, the amount of data we generate and how companies are using that data to get insight. We will pick a business use case, define the data science process, followed by hands-on lab using python and Jupyter notebook. During the hands-on portion we will work with pandas, numpy, matplotlib and sklearn modules and use a machine learning algorithm to approach the business use case.
Zia Khan
LearningFuze
Computer Systems Analyst and Data Scientist
THH 102

Data Science tutorial is designed for people who are new to Data Science. This is a beginner level session so no prior coding or technical knowledge is required. Just bring your laptop with WiFi capability. The session starts with a review of what is data science, the amount of data we generate and how companies are using that data to get insight. We will pick a business use case, define the data science process, followed by hands-on lab using python and Jupyter notebook. During the hands-on portion we will work with pandas, numpy, matplotlib and sklearn modules and use a machine learning algorithm to approach the business use case.

Organizers

Dimas Soeprapto

Dimas Soeprapto

Manager, Advanced Analytics at Edwards Lifesciences

Eli Selkin

Eli Selkin

Co-Founder & CTO at upful.ai

Germain Louie

Germain Louie

Sr. Private Equity Alliances Manager at Sage Intacct

Hana Ueda

Hana Ueda

Volunteer at Data Con LA

Hestia Sartika

Hestia Sartika

NV/CA Regional VP of Technology at Phi Theta Kappa Honor Society International

Hochan Son

Hochan Son

Sr. SRE - Database at Branch

Kaloyan Todorov

Kaloyan Todorov

Managing Sr. Data Scientist at FIGS

Kyle Walker

Kyle Walker

Senior Principal Software Engineer on Data at ZEFR

Murali Kadambala

Murali Kadambala

Sr. Director, Finance(Technology) at LA Metro

Nancy Melucci

Nancy Melucci

Lecturer at CSUDH

Oszie Tarula

Oszie Tarula

Online Marketing and Events Consultant at Hola Metaverso

Randall Chalawsky

Randall Chalawsky

Management Consultant at Great Land Media Group

Rich Ung

Rich Ung

Data Scientist at Disney Media & Entertainment Distribution

Ritesh Dedhia

Ritesh Dedhia

Director, Data and Analytics at Warner Bros Discovery

Robbie Smith

Robbie Smith

Lead Business Data Analyst at Thales Avionics

Ruben Barrios

Ruben Barrios

Information Security Student at Red River College

Sanjeev Sehgal

Sanjeev Sehgal

SVP, Sales at Satwic

Stephen Kwok

Stephen Kwok

Staff Scientist, Bioinformatics at Singular Genomics

Subash D’Souza

Subash D’Souza

Director, Cloud Data Engineering at CSU - Chancellor's Office, Founder at Data Con LA & Data 4 Good

Volunteers

Ahmad Alach

Ahmad Alach

Second-year medical student at Kaiser Permanente Bernard J. Tyson School of Medicine

Angelina Ha

Angelina Ha

AI/ML & Analytics Sales Specialist, Strategic Accounts at AWS

Apurva Gupta

Apurva Gupta

Graduate Student at USC

Asmita Chotani

Asmita Chotani

Co Lead - Los Angeles Chapter at Women In Data

Belle Shen

Belle Shen

Data Science Teaching Assistant at LearningFuze

Carlos Olivas

Carlos Olivas

Software Developer at SAIC

Charlie Chang

Charlie Chang

Technical Program Manager at Advanced Semiconductor Engineering

Courtney Fowler

Courtney Fowler

Supply Chain Operations Manager at Flexport

Dolapo Kester, CSM

Dolapo Kester, CSM

Analyst at Warner Bros Discovery

Hari Krishnan Umapathy

Hari Krishnan Umapathy

Sr. Manager, Data Engineering at Beachbody

Jason Kang

Jason Kang

Software Engineer at NASA-JPL

Joana Perdomo

Joana Perdomo

Systems Engineer at Raytheon Intelligence & Space

Karintha Marshall

Karintha Marshall

Director, Information Systems at Wilshire Boulevard Temple

Mawada Felemban

Mawada Felemban

MSBA Student at USC

Mitzi Bandera

Mitzi Bandera

Data Analyst at Foundation for California Community Colleges

Mrinmoy Das

Mrinmoy Das

Staff Data Engineer at Warner Bros Discovery

Naveena Parasa

Naveena Parasa

Data Engineer at Bank Of America

Nikhil Jindal

Nikhil Jindal

CS Grad Student at USC

Parth Kapadia

Parth Kapadia

CS Grad Student at USC

Priya Chauhan

Priya Chauhan

Senior Director of External Relations at First Source

Riti Agrawal

Riti Agrawal

CS Grad Student at USC

Ronald Walcott

Ronald Walcott

Director at Arts Collaborative Central

Sadman Rahi

Sadman Rahi

MSBA Student at USC

Shana Sanders

Shana Sanders

Audio, Video, Security Systems Technician at Los Angeles County Internal Services Department

Tanmay Dhaundiyal

Tanmay Dhaundiyal

Graduate Student at USC

Tariq Rahiman

Tariq Rahiman

Quality Assurance Manager at Pangea

Unnati Singhal

Unnati Singhal

CS Grad Student at USC

Viva Fung

Viva Fung

Lead Data Engineer at AXS

Data Con LA 2021 Videos

Data Con LA 2021 Videos

Illustration of presentation
The Largest Data Conference in Southern California.

Spearheaded by Subash D’Souza and organized and supported by a community of volunteers, sponsors and speakers, Data Con LA features the most vibrant gathering of data and technology enthusiasts in Los Angeles.

Data Con LA began as Big Data Camp LA in 2013, with just over 250 attendees. It was rebranded to Big Data Day LA in 2014 with over 550 attendees in 2014, 950+ attendees in 2015, 1200+ attendees in 2016, and 1550+ attendees in 2017. In 2018, we re-branded ourselves from Big Data Day LA to Data Con LA with over 1800 attendees and over 2000 in 2019. In response to the COVID-19 pandemic, DCLA had its first successful virtual conference in 2020 with over 1000 virtual attendees. DCLA was virtual again in 2021 with over a 1000 attendees.

Data Con 2021 by the numbers

1000+
Attendees
100+
Speakers
100+
Sessions

2022 Session Tracks

  • AI/ ML/ Data Science
  • Data Engineering
  • Data Infrastructure & Security
  • Emerging Tech
  • Data 4 Good
  • BI/ Reporting/ Business Use Cases

List of panels, sessions and more will be available soon. Check back for more details.