At a time when tackling environmental challenges is of paramount importance, the cleantech industry plays a central role in promoting sustainable solutions. However, technological innovation in the cleantech sector requires a deep understanding not only of the technologies, but also of the market requirements. This information is usually embedded in a large amount of patent and media data, which is difficult to analyze manually to effectively capture the development trend. Using Natural Language Processing (NLP) and the latest advancements in Large Language Models (LLMs) is a natural choice to accelerate innovation. In this workshop, we will share our insights gained in solving this task. Several presentations on various relevant topics will be offered, followed by a hands-on session where participants can try out our LLMs-powered cleantech question-answering and recommendation system.


  • Promote the next generation of NLP enthusiasts and innovators in the cleantech industry
  • Show how NLP and LLMs can be used to accelerate innovation with media and patent text data


09:00 – 09:10 Introduction
09:10 – 10:00 Decoding cleantech

Expert techniques for analyzing patent and media data including NLP for innovation intelligence, LLMs and their application in the cleantech industry, RAG and its application for cleantech innovation, and LLMs-augmented recommender systems for cleantech innovation

10:00 – 10:30 Emerging visions in cleantech
Insights from patent and media data
10:30 – 11:00 Break
11:00 – 12:30 NLP in action
Practical coding for patent and media data analysis in cleantech