Skip to content

Multimodal inputs #23

@dadahl

Description

@dadahl

Buttons, radio button, checkbox -- could they just send an utterance? or send a "button press"?
Camera input, for example biometrics. How does that work?
We could use the mime type to tell the agent that it's getting an image
encode image or audio as base 64 ASCII to send to server
Could WebRTC be helpful?
Does audio also need to specify an encoding along with the mime type?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions