|
演讲简介
By producing data on a scale unimaginable even a few years ago, the Web is transforming the way we learn about our own behavior. In this scenario, Data Science is emerging as a multidisciplinary endeavor that combines decades of knowledge from disparate fields, such as statistics, computer science, and the social sciences. This new science has the potential to deliver billions of dollars of value to the economy and a great deal of social capital to society.
The first part of my talk will focus on one particular domain of Data Science where I see a great wealth of opportunities to investigate social behavior: using the Web to engage volunteers to participate in large-scale online randomized experiments. I will present a case from collaborations with major companies in the sharing economy, namely Airbnb and Couchsurfing. Our results demonstrate that reputation systems can serve as mechanisms to reduce obstacles to the growth and to the healthy function of these businesses, such as social biases and suboptimal market matchings.
In the second part I will switch to discuss the foundations of Data Science. Even though in recent years there has been extraordinary progress in deriving business value from data, the field is still in its early stages; I see tremendous opportunities for growing and strengthening its foundations. A critical long-term goal of the Data Science community concerns the provision of rigorous guarantees on how much actionable information we can derive from a given amount of data. Our research shows evidence that this can be accomplished, as illustrated by the example of learning hidden network structures from records of information cascades.