Amazon’s Nova Is Another New Line of Multimodal AI Models

AWS introduced Amazon Nova, a suite of multimodal AI models, at its reinvent conference, promising to deliver advanced text, image, and video.

On Tuesday, Amazon Web Services (AWS) introduced Amazon Nova, a suite of multimodal AI models, at its reinvent conference, promising to deliver advanced text, image, and video generation capabilities.

The Nova AI generator family includes four models – Micro, Lite, Pro, and Premier – all Premier are currently available, with the release set for early 2025. Complementing these, advanced image and video generation capabilities are enabled by Nova Canvas and Nova Reel, respectively.

AWS CEO Andy Jassy highlighted the progress behind these models, stating, “If we were finding value out of them, you would probably find value out of them.”

Amazon Nova AI for Speed and Sophistication

The models cater to different needs: Micro handles quick text responses, Lite and Pro process longer text, image, and video inputs for tasks like summarizing charts or analyzing diagrams. Premier offers a custom AI “teacher” feature of large context windows, expanding from 300,000 to over 2 million tokens by 2025.

On the media-creation side, Amazon Nova model Canvas provides users with the ability to manipulate images and insert objects using highly customizable prompts, while Nova Reel creates six-second videos with 360-degree rotation and zoom, among other features. Reel will soon support two-minute videos.

Both Canvas and Reel have a strong focus on responsible use, with watermarking and moderation features to help prevent the spread of harmful content. AWS said that generative AI Nova follows safety precautions around misinformation and other dangers, but didn’t provide many specifics.

Expanding Capabilities Amazon Nova

AWS intends to make a speech-to-speech model available in Q1 2025 and an “any-to-any” model by mid-2025. Such models will change everything from real-time translation to the creation of content.

“You will be able to input text, speech, images, or video and output text, speech, images, or video,” Jassy said, emphasizing how many things are going to change with such advancements.

AWS keeps its training data sources confidential and indemnifies customers against copyright issues, reflecting an industry trend of protecting proprietary methods and intellectual property.

These sets of innovations underscore the exponential possibility of AI chatbot Nova in reinventing industries and user experiences as AWS continues to develop its Nova offerings. Speed, scalability, and ethical deployment form the bedrock for Amazon Nova, a significant leap forward in the evolution of AI technology.


Inside Telecom provides you with an extensive list of content covering all aspects of the tech industry. Keep an eye on our Intelligent Tech sections to stay informed and up-to-date with our daily articles.