Multimodal AI combines text, image, and audio inputs/outputs, as seen in GPT-4 Vision.
« Back to Glossary Index
« Back to Glossary Index
Multimodal AI combines text, image, and audio inputs/outputs, as seen in GPT-4 Vision.
« Back to Glossary Index