In my experience, what makes gpt-4 great for coding is its astonishing knowledge of available software libraries, built-in interface features, etc.
I’ll tell it the task I want done, and it will tell me where to find, and how to install the necessary dependencies.
With zero experience in browser extension design, gpt-4 helped me to build an incredibly complicated Chrome extension, using vector database; creating a custom, cloud-based server; web scraping with headless browsers, voice recognition, speech synthesis, wake-word capabilities, and a sophisticated user interface. I had ZERO experience with ANY of these.
For me, using gpt-4 was like collaborating with a just okay programmer, but one who had extensive experience with literally every programming language, API, protocol, etc.
And it was a collaboration. We would talk through problems together. I would make observations and guesses about why a block of code wasn’t working, and it would tell me why I was wrong, or alternately tell me I was right, and produce a fixed version.
I’m not ready to talk about it in detail. Even my boss doesn’t know. But you’re in the right ballpark.
I’m actually building a proof-of-concept prototype for what I want to work on… and I’m using a browser extension so that I can build it independently without anyone from the tech team being involved and slowing me down.
That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.
In my experience, what makes gpt-4 great for coding is its astonishing knowledge of available software libraries, built-in interface features, etc.
I’ll tell it the task I want done, and it will tell me where to find, and how to install the necessary dependencies.
With zero experience in browser extension design, gpt-4 helped me to build an incredibly complicated Chrome extension, using vector database; creating a custom, cloud-based server; web scraping with headless browsers, voice recognition, speech synthesis, wake-word capabilities, and a sophisticated user interface. I had ZERO experience with ANY of these.
For me, using gpt-4 was like collaborating with a just okay programmer, but one who had extensive experience with literally every programming language, API, protocol, etc.
And it was a collaboration. We would talk through problems together. I would make observations and guesses about why a block of code wasn’t working, and it would tell me why I was wrong, or alternately tell me I was right, and produce a fixed version.
So what are you building? A browser STT interface for chatting with GPT and other LLMs?
I’m not ready to talk about it in detail. Even my boss doesn’t know. But you’re in the right ballpark.
I’m actually building a proof-of-concept prototype for what I want to work on… and I’m using a browser extension so that I can build it independently without anyone from the tech team being involved and slowing me down.
That sounds nice. I’ve been looking at serenade.ai and thought about extending their STT with an option to use another third-party STT engine. I would then like to extend their command engine with LLM command recognition. In my experience, maybe also with my pronunciation as a non-english speaker, their STT and command recognition really doesn’t work that well.
Have you tried Whisper from OpenAI? It’s the best I’ve ever seen. I’m curious how it would handle accents.
No, not yet. But thanks for the tip!