Guang-Jie Ren is a Principal Research Scientist at IBM Almaden Lab and currently manages a team of researchers and engineers working on the next-generation data platform for enterprise-grade foundation models. Prior to the current role, Guang-Jie led a multi-disciplinary team and built a series of user-centered industry applications, including mobile-conversation app for business travel, web-conversation app for travel shopping, conversation UX design, conversation anywhere agents, next-generation business modeling tool, and enterprise match-making platform. Trained as an industrial engineer, Guang-Jie received his PhD from the University of Cambridge for research work on service business transformation and joined IBM Research in 2009 for contribution to the emergence of Service Science.
Data management for enterprise-grade foundation models: opportunities and challenges.
Data is the fuel that powers foundation models. And to ensure data quality and mitigate various risks for enterprise use, massive datasets have to be properly acquired, cleared and preprocessed, with appropriate access control, data versioning and metadata collection. In this talk, we’ll provide an overview of IBM’s approach to building enterprise-grade foundation models, dive into details about datasets and pipelines, and discuss the use of data lakehouse for opportunities and challenges in data management.