Learn about the Latest Advancements in AI and Tech!
Find the Best AI and Tech Tools
Teach you how to use the Best AI and Tech Tools
Weekly Live-Stream where I Teach, Demo, Explore and do Q&A for FREE!
CLICKED CREATIONS AI NEWS: VOL. 3 - NOV 12th 2024
This weeks AI Updates:
Chat GPT Introduces ChatGPT Search: Integrating Live Web Search & Interaction WITH an AI Assistant!
Gemini's New Tool Integrations: Google’s Most Advanced AI Ecosystem.
Claude’s Adds New PDF Image Analysis Tool: Open in Beta plus additional Features.
Microsofts's New Open-Source AI: New Open-Source Multi-Agent Framework & Screen Parsing Tool.
Meta's New Open Source Eco System: adding AI Robotics and Mobile Optimized LLMs.
Prime Video’s X-Ray Recaps: AI-Powered Summaries to Keep You In the Loop.
Runway: Updates Camera Control in Gen-3 Alpha Turbo.
Suno’s New Personas Feature: Now you can Capture, Create, and Share Unique Track Vibes.
Introducing ChatGPT Search: Integrating Live Web Search & Interaction WITH an AI Assistant!
Quick, Accurate Responses: Get immediate answers from the web without needing to jump between search engines.
Reliable Sources Included: ChatGPT links to news articles, sports updates, weather, stock info, and more, helping you dig deeper if you want.
Easy-to-Use Search: Just ask your question naturally, and ChatGPT can pull the latest info from trusted sources.
Simple Follow-Ups: Keep asking follow-up questions in the same conversation, and ChatGPT uses the context for more accurate answers.
Available Now for Plus and Team Users: Try ChatGPT Search at chatgpt.com or add it directly to your Chrome browser with the extension.
Learn More with Linked Sources: Click on the ‘Sources’ button to get direct links to referenced articles, blogs, and data.
New and Improved Design: Find everything from local weather to maps and stock updates in a visually simple, easy-to-navigate format.
Built with Trusted News Partners: ChatGPT collaborates with major publishers like Vox Media, Reuters, and Le Monde for reliable information.
Amanda Caswell Tested Google vs Chat GPT, Check out her Results! Download Chat GPT: https://openai.com/chatgpt/download/ Download Chrome GPT Search Engine: https://chromewebstore.google.com/detail/chatgpt-search/ejcfepkfckglbgocfkanmcdngdijcgld
Gemini's New Tool Integrations: Google’s Most Advanced AI Ecosystem
Three Ways to Experience Gemini
Chat with Gemini
Access Gemini to help with various tasks, from writing to coding. Use the Gemini app or upgrade to Gemini Advanced for Google’s most capable AI models on your phone.
Gemini in Google Products
Gemini is integrated into Google’s suite of tools, enhancing functionality:
Slides: “Help me design” for better presentations.
Sheets: “Help me organize” for efficient data management.
Meet: Auto note-taking feature.
Pixel Recorder: Summarize recordings.
NotebookLM: An AI-powered research assistant in Labs.
Build custom AI apps and agents by integrating Gemini models with Google Cloud services or Google AI Studio.
Recent Updates in Gemini
Gemini 1.5 Flash
Launched with faster response times and expanded access.
Gemini App Update (Oct 2024)
Google’s Gemini is built to work with all the product in its ecosystem making task flows run much more effectively and smoothly and being able to run on everything from mobile devices to powerful data centers, and supports text, images, audio, video, and code is very useful.
What do you think about Gemini?
Claude’s New PDF Image Analysis and Latest Features
Claude can now interpret images in PDFs, including charts and diagrams, making it easier to review complex documents without manually describing visuals.
Great for technical documents, research papers, and any PDFs with crucial visual content.
How to Use PDF Image Analysis in Claude
Login: Sign into Claude.
Attach PDF: Use the paperclip icon to upload your document.
Beta Code: Enter “anthropic-beta: pdfs-2024-09-25” in the prompt box to activate the feature.
Prompt: Ask Claude to explain or review specific sections.
Here is a great example given by Kaycee Hill showing off Claude's PDF abilities.
Other Updates
Desktop App: Claude now has a Mac and Windows app for easier access.
Computer Control: Beta feature lets Claude perform tasks on-screen, like moving the cursor and typing (not available in the app).
Voice Input on Mobile: Android and iOS apps support dictation, letting you ask questions by voice.
Microsofts Releases New Open-Source Multi-Agent Framework & Screen Parsing Tool.
Microsoft's Magentic-One Framework
Overview: Microsoft has introduced Magentic-One, an open-source multi-agent system designed to coordinate specialized AI agents for complex, multi-step tasks.
Functionality: The framework employs an Orchestrator agent to manage agents such as WebSurfer (web navigation), FileSurfer (file management), Coder (code generation), and ComputerTerminal (code execution).
Availability: Magentic-One is accessible to researchers and developers under a custom Microsoft License, facilitating integration into various applications.
Microsoft's OmniParser Tool
Purpose: OmniParser is an open-source model that converts screenshots into structured data, enabling AI agents to better understand and interact with graphical user interfaces (GUIs).
Components: It utilizes models like YOLOv8 for detecting interactive elements, BLIP-2 for understanding their functions, and OCR for text extraction, enhancing AI's capability to navigate and operate within screen-based environments.
Recognition: OmniParser has gained significant attention, becoming a top-trending model on the AI code repository Hugging Face.
Meta's New Open Source Eco System adding AI Robotics and Mobile Optimized LLMs.
Image Source: https://venturebeat.com/ai/meta-unveils-ai-tools-to-give-robots-a-human-touch-in-physical-world/
Llama 3.1 Open Source AI Updates
New Models Released: Llama 3.1 (405B, 70B, 8B) models offer advanced open-source alternatives with improved performance and cost efficiency.
Developer Benefits:
Customization: Fine-tune models for specific data and tasks.
Control & Security: Run models in-house, protecting data without relying on external vendors.
Reduced Costs: Operating costs are about half that of similar closed models like GPT-4o.
Partnerships: Collaborations with Amazon, NVIDIA, and others to expand Llama’s ecosystem across major cloud platforms.
Robotics and Embodied AI Initiatives
Touch Perception & Dexterity:
Sparsh: Vision-based tactile sensing model trained on 460,000 tactile images for better touch perception in robots, enhancing applications like medicine and VR.
Digit 360 & Digit Plexus: Advanced tactile sensors for robots, capturing fine-grained touch data to improve dexterity, available open-source to stimulate community innovation.
Human-Robot Collaboration Benchmark:
PARTNR: A new benchmark for evaluating AI models’ ability to collaborate with humans on complex tasks, using a simulation with 100,000 tasks in virtual environments.
Llama 3.1 models are now available at llama.meta.com.
Prime Video’s X-Ray Recaps: AI-Powered Summaries to Keep You In the Loop
What It Is: X-Ray Recaps uses AI to provide quick, spoiler-free summaries of episodes, seasons, or specific moments in shows, perfect for catching up without rewinding or risking spoilers.
How It Works: Powered by Amazon Bedrock and SageMaker, it analyzes video and subtitles to generate summaries of key events and characters.
Access: Available on Prime Video’s X-Ray feature during playback or on show detail pages, letting you choose recaps for the episode, season, or previous seasons.
Availability: Currently in beta for U.S. Fire TV users, with wider device support by year-end, initially supporting Amazon Originals like The Boys and Daisy Jones & The Six.
Runway: Updates Camera Control in Gen-3 Alpha Turbo
Runway’s New AI Camera Controls: Precision Moves for AI Video Editing
New Camera Controls: Runway’s AI video editor now includes precise camera controls for pans, tracks, and zooms, enhancing creative options for AI video creators.
Features: With these tools, users can move around subjects, arc, and speed-ramp through scenes, adding dynamic angles and motion to their videos.
Availability: The tools are part of Runway Gen-3 Alpha Turbo, available for free or with paid plans starting at $12/month, giving AI artists affordable access to advanced video editing capabilities.
Suno’s New Personas Feature lets you Capture, Create, and Share Unique Track Vibes
What Are Personas?
Personas let you capture a song’s unique vibe—its vocals, style, and energy—and save it as a creative asset to use in new tracks, giving each song its own identity.
How It Works
Choose a song, click “Create” > “Make a Persona.”
Set the Persona to public or private. Public Personas link back to your profile and can inspire others.
Why Use Personas?
Personas preserve the essence of music you love, enabling you to remix and build upon favorite tracks, inviting collaboration and fresh inspiration.
Early Access and Feedback
Currently in beta for Pro and Premier subscribers, with 200 free Persona creations included. Each new Persona after costs 10 credits. Suno is gathering feedback to improve the feature.
If you have not already, Subscribe to our FREE AI Newsletter where you can get FREE Live Trainings, Tutorials, Q&A's and much more, all in the spirit of helping the average Joe take advantage of AI and Technology without spending 10's of thousands of dollars and spinning your wheels. Free AI Newsletter HERE: Also, check out our other socials! SOCIALS Website: http://www.ClickedCreations.com
Youtube Channel: https://www.youtube.com/@SkystruckOnline
Live Streaming Youtube Channel: https://www.youtube.com/@SkystruckLive
Music Youtube Channel: https://www.youtube.com/@citizenskystruck
Instagram: https://www.instagram.com/skystruckonline
Facebook: https://www.facebook.com/SkystruckOnline
X.com (Twitter): https://x.com/SkystruckOnline
Pinterest: https://www.pinterest.com/ClickedCreations
BSKY: https://bsky.app/profile/skystruckonline.bsky.social Substack: https://substack.com/@skystruck
Google Business: https://posts.gle/JcS5gg
Comments