Do LLMs dream of Type Inference?

The video titled "Do LLMs dream of Type Inference?" presented by Shunsuke "Kokuyou" Mori at RubyConf 2024 focuses on the potential of Large Language Models (LLMs) to assist in type inference for Ruby code. The session highlights the development and evaluation of RBS Goose, an LLM-based type inference tool, that aims to redefine how type inference is performed compared to traditional methods.

Key points discussed in the session include:

- Introduction to Type Inference: The speaker explains the significance of type inference in programming, emphasizing the challenges posed by dynamic typing in Ruby, where types are determined at runtime rather than compile time.

- Limitations of Traditional Approaches: Traditional static type checking often fails to recognize certain dynamic elements of Ruby, making it difficult to identify type errors before code execution. Examples highlight common pitfalls in type inference using algorithms that rely solely on data flow analysis, lacking human-like intuition.

- RBS Goose Development: The presentation details how RBS Goose infers Ruby's type definitions from code without explicit type annotations. Using LLMs like ChatGPT, RBS Goose replaces unknown or untyped elements in code with inferred concrete types by analyzing context and conventions, which aligns more with human understanding.
- Evaluation Challenges: The speaker acknowledges that while RBS Goose has shown improved performance over traditional methods in several cases, the lack of standardized metrics makes it challenging to assess its effectiveness comprehensively. Currently, evaluations are manual, making it difficult to track improvements over time.

- Benchmark Proposal - TypeEvalRb: Discussions include plans for establishing a more detailed evaluation methodology, inspired by previous studies in type inference, such as Styper for Ruby and Type EV Pi for Python. The proposed benchmarks will help quantify the performance of type inference tools like RBS Goose, focusing on the comparison of expected and inferred types.

- Future Directions: The session concludes with the speaker expressing optimism about refining the evaluation process for RBS Goose. By combining insights from past studies with new benchmarks, there is potential for significant advancements in LLM-based type inference tools in Ruby.

Overall, the talk provides an insightful overview of how Large Language Models can enhance type inference, addressing both current successes and the need for improved evaluation frameworks to measure progress in this innovative field.