I am sure you have been reading about Hadoop (an Apache open source project), MapReduce, and other new technologies being used to process and analyze large datasets at some of the leading internet companies: Google, Yahoo, Myspace, etc. The datasets in our companies continue to grow exponentially in size. Business requires an even quicker turnaround of analysis and reports. We must understand and evaluate these technologies as well!
Luke Lonergan and Brian Dolan traveled to DC from the West Coast and lead a discussion on Large Scale Data Analytics. We reviewed how MySpace is using Hadoop and Greenplum to analyze petabyte scale data. The emphasis was on tools, processes, algorithms and best practices. Advertising on the internet has proven to be one of the most profitable businesses in history and it’s all about using data to deliver the most efficient ads – something Brian Dolan has focused on since the launch of ad operations at FIM / MySpace.
Luke Lonergan (co-founder and CTO at Greenplum): Prior to Greenplum, Luke founded Didera, a database clustering company, in 2000 and served as CEO and Chairman. Luke’s background includes 16 years of management experience in computing technology ranging from innovations in supercomputing to advances in medical imaging systems. Most recently, he directed data center integration at High Performance Technologies Inc (HPTi), scaling the business to $30M, and setting industry firsts in parallel computing subsequently adopted by IBM and Compaq. Previously he held management positions at Northrop Grumman Corporation. He holds an M.S. in Aeronautics and Astronautics from Stanford University and a B.E. in Mathematics from Vanderbilt University.
Brian Dolan (Director of Research Analytics at Fox Interactive Media / MySpace): Brian directs business analytics at Fox Interactive Media and is responsible for targeted advertising analytics. His specialties include time series analysis, applied stochastic processes and optimization. He has over 17 years of analytic experience at Yahoo and MySpace where he has focused on the application of high-dimensional analysis of large data sets, especially longitudinal analysis, to solve challenges in audience based revenue targeting.