Langchain agent
Empower Langchain agents to interact with the real world via phone calls
Introduction
This example shows how to use Vocode as a tool augmenting the abilities of a Langchain agent. By providing it with access to Vocode, a Langchain Agent can now make autonomous phone calls and take action based on the outcome of the calls.
Our demo will walk through how to instruct the agent to lookup a phone number and make the appropriate call.
How to run it
Requirements
Run the example
Note: gpt-4
is required for this to work. gpt-3.5-turbo
or older models are not smart enough to parse JSON responses in Langchain agents reliably out-of-the-box.
To get started, clone the Vocode repo or copy the Langchain agent app directory.
Environment
- Copy the
.env.template
and fill in your API keys. You’ll need:
- Deepgram (for speech transcription)
- OpenAI (for the underlying agent)
- Azure (for speech synthesis)
- Twilio (for telephony)
- Tunnel port 3000 to ngrok by running:
Fill in the TELEPHONY_SERVER_BASE_URL
environment variable with your ngrok base URL: don’t include https://
so should be something like:
- Buy a phone number on Twilio or verify your caller ID to use as the outbound phone number.
Set this phone number as the
OUTBOUND_CALLER_NUMBER
environment variable. Include+
and the area code, so for a US phone number, it would look something like.
Set up self-hosted telephony server
Run the following setups from the langchain_agent
directory.
Running with Docker
- Build the telephony server Docker image
- Run the service using
docker-compose
Running with Python
- (optional) Set up a Python environment: we recommend
virtualenv
- Install requirements
- Run an instance of Redis at http://localhost:6379. With Docker, this can be done with:
- Run the
TelephonyServer
:
Set up the Langchain agent
With the self-hosted telephony server running:
- Update the phone numbers in the contact book in
tools/contacts.py
- Run
main.py
Code explanation
🚧 Under construction