Giving My Wife a Voice with Ruby and AI Cloning (+ Discussion on The AI and Ruby Landscape)

Giving My Wife a Voice with Ruby and AI Cloning highlights Kane Hooper's journey in developing an AI voice cloning application to help his wife, Peggy, regain her ability to communicate after losing 80% of her speech capacity due to a neurological condition. The talk takes place at RubyConf AU 2024 and tackles the intersection of AI technology and Ruby programming.

Key Points Discussed:

- Personal Story: Kane opens with personal anecdotes about his wife, a trained classical singer who was diagnosed with a voice-affecting condition. He shares emotional moments including a song recorded by her before she lost her voice.

- Technology Overview: The presentation progresses to discuss advancements in AI voice cloning technology. Kane utilizes 11 Labs' service, showcasing differences between AI-generated voices and genuine recordings. He emphasizes the availability of quality recordings for more accurate cloning results.

- Challenges and Developments: Kane shares ongoing development for an application that enables his wife to communicate by phone, particularly with call centers, demonstrating the potential of AI in real-life situations. He conducts a live demo that illustrates the voice cloning in action.

- Ethical Concerns of AI: The dark side of AI voice cloning is addressed, including risks associated with deep fakes and identity fraud. Kane recounts real-life examples where AI cloning caused significant losses to companies, stressing the importance of vigilance and validation from multiple sources.

- Technical Architecture: Kane elaborates on the technical details of his application, utilizing Ruby for backend development, integrating Twilio for calls and 11 Labs for voice processing. He simplifies the process to just an API call, making the technology accessible to Ruby developers.

- Open-Source and Future Prospects: The conversation expands to the broader AI landscape and the emergence of open-source tools, encouraging Ruby developers to embrace AI technologies for innovation and contribution.

- Prompting and Knowledge Retrieval in AI: Kane explains the nuances of prompting AI systems and how effective instructions yield precise responses. He discusses the potential of AI in context-based knowledge retrieval, enhancing user experiences in various applications.

- Conclusion: Kane concludes with the message that AI is within reach for Ruby developers, inviting collaboration to harness its capabilities for societal benefits. He offers consultations for developers looking to integrate AI into their projects, aiming to inspire a community-focused approach to innovation.

Overall, the presentation emphasizes a blend of personal narrative, technical insight, and ethical considerations in the rapidly evolving field of AI and its practical applications in programming with Ruby.

Giving My Wife a Voice with Ruby and AI Cloning (+ Discussion on The AI and Ruby Landscape)
Kane Hooper • April 11, 2024 • Sydney, Australia

Last year my wife was diagnosed with a neurological condition causing her to lose 80% of her speech capacity. Through the use of AI voice cloning technology and Ruby in the backend I was able to build an application that has given her back her voice. In this talk I will walk through the AI voice cloning technology and the key Ruby code to build an application that utilises her own voice.

We will also look at the overall AI landscape, what tools and technologies (particularly open-source) can be integrated into Ruby applications.

RubyConf AU 2024