Latent Space: The AI Engineer Podcast

Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)

55 snips
Aug 22, 2024
Alistair Pullen, Co-founder and CEO of Cosign, discusses the groundbreaking advancements of Cosine Genie, the top coding agent that utilizes fine-tuned GPT-4o technology. He shares insights on the innovative training techniques that enable the model to learn from real software engineers, enhancing coding efficiency. The conversation also delves into the challenges of fine-tuning models, the importance of synthetic data, and future innovations in AI tooling, revealing the transformative potential of advanced language models in software development.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Importance of Data Cleaning

  • Early versions of Cosign's model, trained on uncleaned data, exhibited unwanted behaviors like arguing in code reviews.
  • This highlighted the importance of data cleaning and aligning it with desired model behavior.
INSIGHT

Prioritizing Core Features

  • While helpful, web browsing is less critical than retrieving relevant files when building coding agents.
  • Focus on fundamental tasks before adding complex tools.
INSIGHT

Semantic Code Search

  • Semantic code search is difficult due to the difference between code and natural language.
  • Training a model to translate natural language queries into code snippets improves retrieval accuracy.
Get the Snipd Podcast app to discover more snips from this episode
Get the app