Within Whatsapp, users have the option to choose between sending a text message, an image, or both. TypeBot, on the other hand, provides separate text and image inputs, lacking a combined option. Is there any workaround to this in TypeBot? (I'm using GPTVision)
Actually my problem is deeper. Can't sent any file from Whatsapp (thought Evolution API) to TypeBot. Is it possible? Maybe just possible with the official API?
With the official API can I do what I'm trying to? Let the user and text, image or text + image together? I want to give the user the option to send images (GPTVision).
This is a problem with evolution. Afaik, the evolution api treatment response coming from type bot is very limited and doesn’t able to handle and format all data from typebot.
I'm not sure if Evolution is not even sending the message to Typebot (cause the unknow format) or if it's sending to Typebot and Typebot is the one returning the error.
I'm thinking about add a layer between TypeBot and whatever (Evolution at this moment, but now sure what is the bets option). So Typebot will always receive a JSON like { tex: 'xxx', image: 'url' }.
It also supports files, as we already have the File Upload block. However, it currently lacks a component that allows users to input text, upload a file, or combine text and file inputs—a feature similar to the interface I'm currently using here to type this message.
With the rising popularity of GPTVision, coupled with the fact that it is becoming more cost-effective, I believe that all agents will inevitably transition in this direction.