Google Files Adds Gemini Feature, PDF Inquiry Service Launched
Google's Files app now lets users activate Gemini to ask questions about PDFs, requiring a Gemini Advanced subscription. This feature, introduced at..
Read moreOpenAI Event 12: Preview of New Inference Models o3 and o3-mini
OpenAI previewed its latest advanced reasoning models, o3 and o3-mini, at the "12 Days of OpenAI" event. These models excel in coding, mathematics,..
Read moreMeta AI Launches ExploreToM: New Advancement in AI Theory of Mind Evaluation
ExploreToM, a framework using the A* search algorithm, generates diverse and challenging datasets to evaluate and enhance Theory of Mind (ToM) in..
Read morePatronus AI Launches Glider for Evaluating Large Language Models
Patronus AI launched Glider, a 3.8 billion parameter open-source model for evaluating large language models, outperforming larger tools in accuracy..
Read moreMicrosoft Introduces Multilingual Real-Time Translation in Windows 11 Preview
Microsoft is previewing a real-time translation feature on Copilot Plus for Intel, AMD, and Qualcomm platforms, supporting over 44 languages into..
Read moreKuaishou Launches Keling 1.6 Model, Enhancing Video Generation Capabilities
Kuaishou's Keling 1.6 model enhances text responsiveness, visual aesthetics, and motion coherence, offering standard and high-quality modes...
Read moreInstagram to Launch AI Video Editing Feature for Easy Element Modification
Instagram plans to launch an AI-powered video editing feature next year, allowing users to modify videos with text commands. Based on Meta's Movie..
Read moreState Grid Unveils "Bright Power Model" in Beijing
State Grid Corp of China launched the "Bright Power Mega Model," a trillion-parameter AI model for the power industry, excelling in data..
Read moreGoogle Releases Experimental "Reasoning" AI Model Gemini 2.0 Flash Thinking
Google has introduced Gemini 2.0 Flash Thinking Experimental, an AI model focusing on reasoning, available on AI Studio. It excels in multi-modal..
Read moreGitHub Launches Free Copilot Subscription for 150 Million Developers
GitHub launched a free Copilot subscription, allowing 150 million developers to use AI features in VS Code, including code generation and chat..
Read moreOpenAI Event 11: Enhancing ChatGPT Desktop App with New Integrations
OpenAI announced new integrations for the ChatGPT desktop app, including support for Apple Notes, Notion, Quip, and multiple IDEs. The update..
Read moreByteDance Launches Dobao Visual Understanding Model, Significantly Reducing Application Costs
ByteDance launched the Doupao Visual Understanding Model at the Volcano Engine Force Conference, offering cost-effective multimodal capabilities at..
Read moreIBM Open-Sources Granite 3.1 Models, Aiming to Lead in Enterprise LLMs
IBM launched Granite 3.1, an upgraded LLM with 128K token context, new embedding model, and hallucination detection, outperforming competitors in..
Read moreNVIDIA Launches New Mini Supercomputer with Enhanced Generative AI Features
NVIDIA's new Jetson Orin Nano Super Developer Kit offers a 70% performance boost, advanced AI capabilities, and is priced at $249. It features an..
Read moreTech Giants Battle in AI, NVIDIA Chip Demand Surges
Tech giants like Microsoft, Meta, and Google are competing in AI. Microsoft is the largest buyer of NVIDIA AI chips, purchasing 485,000 Hopper..
Read moreApple and Nvidia Collaborate to Enhance Text Generation Performance of Large Language Models
Apple and NVIDIA collaborated to integrate Apple's ReDrafter technology into NVIDIA's TensorRT-LLM, boosting token generation speed by 2.7x and..
Read moreMicrosoft AI Research Open-Sources PromptWizard: Innovative AI Framework for Optimizing Black-Box LLM Prompts
PromptWizard, an AI framework by Microsoft India Research, optimizes prompts for black-box LLMs using a feedback-driven mechanism. It improves task..
Read moreOdyssey Startup Develops AI Tool to Convert Text or Images into 3D Renderings
Odyssey, founded by Oliver Cameron and Jeff Hawke, develops Explorer, an AI tool converting text or images into 3D renderings. Trained with..
Read moreAlibaba Launches CosyVoice 2: Enhanced Streaming Speech Synthesis Model
Alibaba's CosyVoice 2 enhances speech synthesis with unified streaming and non-streaming modes, improved pronunciation accuracy, and advanced..
Read moreGitHub Launches Free Version of Copilot Coding Tool
GitHub is releasing a free version of its Copilot AI coding tool, integrated into VS Code. It aims to expand access, especially for students and..
Read more