Jiawei Han,

(Springer - SNAM Journal Keynote)
Michael Aiken Chair Professor
University of Illinois at Urbana-Champaign
Mining Structured Knowledge from Massive Unstructured Text

Abstract

The real-world big data are largely dynamic, interconnected and unstructured text. It is highly desirable to transform such massive unstructured data into structured knowledge. Many researchers rely on labor-intensive labeling and curation to extract knowledge from such data. Such approaches, however, are not scalable. We vision that massive text data itself may disclose a large body of hidden structures and knowledge. Equipped with pretrained language models and text embedding methods, it is promising to transform unstructured data into structured knowledge. In this talk, we introduce a set of methods developed recently in our group for such an exploration, including joint spherical text embedding, discriminative topic mining, taxonomy construction, text classification, and taxonomy-guided text analysis. We show that data-driven approach could be promising at transforming massive text data into structured knowledge.

Biography

Jiawei Han (Fellow, IEEE) received the B.S. degree from the University of Science and Technology of China, and the Ph.D. degree from the University of Wisconsin-Madison, all in computer science. He is currently Michael Aiken Chair Professor with the Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL, USA. He received ACM SIGKDD Innovation Award (2004), IEEE Computer Society Technical Achievement Award (2005), IEEE Computer Society W. Wallace McDowell Award (2009), and Japan's Funai Achievement Award (2018). He is a fellow of ACM and served as the Co-Director of KnowEnG, a Center of Excellence in Big Data Computing (2014–2019), funded by NIH Big Data to Knowledge (BD2K) Initiative and as the Director of Information Network Academic Research Center (INARC) (2009–2016) supported by the Network Science-Collaborative Technology Alliance (NS-CTA) program of U.S. Army Research Lab.(Based on document published on 3 July 2020).

Daniel A. Keim

University of Konstanz, Germany
Visual Analytics of Network Data: Challenges and Applications

Abstract

Never before in history, network and social media data have been generated and collected at such high volumes as it is today. As the volumes of data available to companies, scientists, and the public increase, their effective use becomes more challenging. Keeping up to date with the flood of data, using standard tools for data analysis and exploration, is fraught with difficulty. The field of visual analytics seeks to provide people with better and more effective ways to understand and analyze large datasets, while also enabling them to immediately act upon their findings. Visual analytics integrates the analytic capabilities of the computer and the abilities of the human analyst, allowing novel discoveries and empowering individuals to take control of the analytical process. The talk presents the challenges of visual analytics and exemplifies them with several application examples from the areas of network and social media analysis, illustrating the exiting potential of visual analysis techniques but also their limitations.

Biography

Daniel A. Keim is professor (Chair of Data Analysis and Visualization) in the Computer Science department at the University of Konstanz, Germany. He has been actively involved in data analysis and information visualization research for more than 30 years and received the 2011 Visualization Technical Achievement Award in recognition of his seminal technical work in high-dimensional data analysis and visualization of large data bases, which has stimulated research in the new field of Visual Analytics. Daniel is associate editor of The Visual Computer and the Information Visualization Journal and co-author of several books dedicated to Interactive Data Visualization. Dr. Keim got his Ph.D. and habilitation degrees in computer science from the University of Munich. Before joining the University of Konstanz, Dr. Keim was associate professor at the University of Halle, Germany and Senior Technology Consultant at AT&T Shannon Research Labs, NJ, USA.

Amit Sheth,

University of South Carolina, USA
Knowledge-infused NLU for Addiction and Mental Health Research

Abstract

With the increasing legalization of medical and recreational use of substances, more research is needed to understand the association between mental health and user behavior related to drug consumption. Specifically, drug overdose and substance use- related mental health issues have become two major topics that have been widely discussed on social media platforms. Big social media data has the potential to provide deeper insights about these associations to public health analysts for making policy decisions. Multiple national population surveys have found that about half of those who experience a mental health illness during their lives will also experience a substance use disorder and vice versa. The communications related to addiction and mental health are complex to process and understand given their language and contextual characteristics. Surface-level data analysis alone is not sufficient to understand the complex nature of relationships among the addiction and mental health context. Moreover, dark web vendors have been using social media as a new marketplace for drugs. Social media users also discuss the novel drugs emerging in dark web marketplaces and associated side effects/health conditions. These communications get complex when researchers try to annotate them or link them to a specific mental health entity. Considering the significant sensitivity of such communications and to protect user privacy on social media, a potential solution requires reliable algorithms for modeling such communications. We demonstrate the value of incorporating domain-specific knowledge in natural language understanding to identify the relationship between mental health and drug addiction. We discuss end-to-end knowledge-infused deep learning frameworks that leverage the pre-trained language representation model and domain-specific declarative knowledge source to extract entities and their relationships jointly. Our model is further tailored to focus on the entities mentioned in the sentence where ontology is used to locate the target entity, position. We also demonstrate the capabilities of inclusion of the knowledge-aware representation in association with language models that can extract the Drug-Mental health condition associations. Acknowledgements: Usha Lokala, Raminta Daniulaityte, Francois Lamy, Manas Gaur, Jyotishman Pathak, and collaborators on NIDA/NIH and NSF funded projects on Addiction and Mental Health.

Biography

Prof. Amit Sheth (Home Page, LinkedIn) is an Educator, Researcher, and Entrepreneur. He is the founding director of the university-wide AI Institute at the University of South Carolina. He is a Fellow of IEEE, AAAI, AAAS, and ACM. He has (co-)founded five companies, including the first Semantic Search company in 1999 that pioneered technology similar to what is found today in Google Semantic Search and Knowledge Graph, ezDI, which developed knowledge-infused clinical NLP/NLU, and Cognovi Labs at the intersection of emotion and AI.