Text or Image Input

At a glance

The community members are discussing the limitations of TypeBot in handling combined text and image inputs, which is a feature available in WhatsApp. They explore the possibility of using the official WhatsApp API to achieve this functionality, but there are concerns about the compatibility with the Evolution API. The community members suggest adding a layer between TypeBot and the API to format the input as a JSON object with separate text and image fields. However, they acknowledge that TypeBot's current API only supports text input, and they are unsure how to support "hybrid messages" that combine text and images.

VVilela™

Within Whatsapp, users have the option to choose between sending a text message, an image, or both. TypeBot, on the other hand, provides separate text and image inputs, lacking a combined option. Is there any workaround to this in TypeBot?
(I'm using GPTVision)

15 comments

VVilela™

Actually my problem is deeper.
Can't sent any file from Whatsapp (thought Evolution API) to TypeBot.
Is it possible?
Maybe just possible with the official API?

VVilela™

☹️

BBaptiste

Can't tell for Evolution as it's not officially supported (you'll have to ask them)

BBaptiste

But with the official WA integration, you can definitely collect files. They will be converted as URL

VVilela™

With the official API can I do what I'm trying to?
Let the user and text, image or text + image together?
I want to give the user the option to send images (GPTVision).

LLuizAlves

This is a problem with evolution. Afaik, the evolution api treatment response coming from type bot is very limited and doesn’t able to handle and format all data from typebot.

VVilela™

I'm not sure if Evolution is not even sending the message to Typebot (cause the unknow format) or if it's sending to Typebot and Typebot is the one returning the error.

BBaptiste

Typebot doesn't know how to process image + text so this won't be possible.

BBaptiste

How do you expect it to behave in that case?

VVilela™

I'm thinking about add a layer between TypeBot and whatever (Evolution at this moment, but now sure what is the bets option).
So Typebot will always receive a JSON like { tex: 'xxx', image: 'url' }.

BBaptiste

Currently the Typebot API only understand some text as input

BBaptiste

I'm really not sure how we could support hybrid messages 🤔

VVilela™

It also supports files, as we already have the File Upload block. However, it currently lacks a component that allows users to input text, upload a file, or combine text and file inputs—a feature similar to the interface I'm currently using here to type this message.

VVilela™

Attachment

VVilela™

With the rising popularity of GPTVision, coupled with the fact that it is becoming more cost-effective, I believe that all agents will inevitably transition in this direction.

Add a reply

Share feedback, ideas and get community help

Text or Image Input