Google

企業記事 23 件2026-05-25 〜 2026-05-29

2026年5月

2026-05-29 20:30 JSTITmedia AI+LLM/生成AI画像/動画生成Gemini

「Nano Banana 2」「Nano Banana Pro」が一般提供開始　「2」は動画からの画像生成もサポート

米Googleは5月29日、「Nano Banana 2」（Gemini 3.1 Flash Image）と「Nano Banana Pro」（Gemini 3 Pro Image）の一般提供を開始すると発表した。加えて、動画入力に対応する新機能をNano Banana 2でプ…

2026-05-29 13:00 JSTarXiv cs.AI研究/論文

$\overline{\mathcal M}_{0,n}$ のポアンカール多項式の実数根: AI 支援による証明

安定したドリーニュ-マンフォード法空間 $\overline{\mathcal M}_{0,n}$ のポアンカレ多項式 \[ P_n(t)=\sum_{i=0}^{n-3} \dim H^{2i}(\overline{\mathcal M}_{0,n};\mathbb{Q})t^i \] の実根があることを証明します。 $n$ が指す有理曲線は、アルフィ-チェン-マルコッリの予想を証明します。証明は Keel--Manin--Getzler 再帰から始まりますが、その主な新しいアイデアはポアンカール多項式の二変量変形 $F_m(y,t)$ です。この変形により、1 変数反復では見えない隠れたインターレース構造が明らかになります。固定 $t<0$ の場合、$y$ 方向の $F_m$ のゼロセットは、$0<1-t$ の区間で Sturm--Rolle 引数によって制御されます。元の多項式はスライス $y=1$ 上で復元され、このスライスを通る移動根の順序付けされた交差により、実根性と厳密なインターレースの両方が得られます。その結果、$\overline{\mathcal M}_{0,n}$ の Betti 数は超対数凹列を形成します。さらに、複素射影線の縮退における $n$ 順序点のフルトン-マクファーソン空間 $\mathbb{P}^1[n]$ のポアンカレ多項式の実根性と超対数凹面を証明します。 $\overline{\mathcal M}_{0,n}$ の証明は、Google DeepMind が開発したエージェントフロンティアモデルシステムである Co-Mathematician による AI 支援ワークフローの反復を通じて得られました。人間の役割は、問題を提起し、連続する試みを評価し、ギャップの修復を要求し、進化する議論を文献と比較し、人間が検証可能な最終的な証拠を組み立てることでした。私たちの追加の人的貢献は、同様の残留変形戦略がフルトン-マクファーソン空間 $\mathbb P^1[n]$ に適用され、対応する実根定理が得られることを観察することでした。

原文 (English)

Real-rootedness of the Poincar\'e polynomials of $\overline{\mathcal M}_{0,n}$: an AI-assisted proof

We prove real-rootedness for the Poincar\'e polynomial \[ P_n(t)=\sum_{i=0}^{n-3} \dim H^{2i}(\overline{\mathcal M}_{0,n};\mathbb{Q})t^i \] of the Deligne--Mumford moduli space $\overline{\mathcal M}_{0,n}$ of stable $n$-pointed rational curves, proving a conjecture of Aluffi--Chen--Marcolli. The proof starts from the Keel--Manin--Getzler recurrence, but its main new idea is a bivariate deformation $F_m(y,t)$ of the Poincar\'e polynomial. This deformation reveals a hidden interlacing structure not visible in the one-variable recurrence. For fixed $t<0$, the zero set of $F_m$ in the $y$-direction is controlled by a Sturm--Rolle argument on the interval $0<1-t$. The original polynomial is recovered on the slice $y=1$, and the ordered crossings of the moving roots through this slice give both real-rootedness and strict interlacing. Consequently, the Betti numbers of $\overline{\mathcal M}_{0,n}$ form an ultra-log-concave sequence. We further prove real-rootedness and ultra-log-concavity for the Poincar\'e polynomial of the Fulton--MacPherson space $\mathbb{P}^1[n]$ of $n$ ordered points in degenerations of the complex projective line. The proof for $\overline{\mathcal M}_{0,n}$ was obtained through an iterative AI-assisted workflow with Co-Mathematician, an agentic frontier-model system developed by Google DeepMind. The human role was to pose the problem, evaluate successive attempts, request repairs of gaps, compare the evolving argument with the literature, and assemble the final human-verifiable proof. Our additional human contribution was to observe that a similar residual deformation strategy applies to the Fulton--MacPherson spaces $\mathbb P^1[n]$, yielding the corresponding real-rootedness theorem.

2026-05-29 13:00 JSTarXiv cs.AILLM/生成AIエージェントOpenAI Alibaba

Does The Way You Plan Matter? An Empirical Study of Planning Representations for LLM Web Agents

Despite recent advances, LLM-based web agents still struggle with limited exploration, omission of critical steps, and sensitivity to task…

2026-05-29 05:00 JSTITmedia AI+LLM/生成AI

「Google Antigravity 2.0と戯れながら感じたこと」と「LLM Wikiを実践して『ロケスマペディア』を作ってみた」

かわさきからは「Google Antigravity 2.0と戯れながら感じたこと」というタイトルで生成AI時代における教科書的コンテンツの存在意義と、AIにコードを書かせる時代の学び方について、一色からは「LLM Wikiを実践して『ロケスマペディア』を作ってみた」というタイ…

2026-05-28 13:00 JSTarXiv cs.AIエージェント

Do Agents Need Semantic Metadata? A Comparative Study in Agentic Data Retrieval

In the era of autonomous agents, machine-actionable data is critical for data-driven workflows. For more than a decade, semantic metadata l…

2026-05-28 09:17 JSTTechCrunch AIその他

Why Google’s AI can’t spell Google (or anything else)

Google is embarrassing itself, again.

2026-05-28 03:39 JSTTechCrunch AIその他

Your SEO strategy is optimized for a search engine that no longer exists.

Google I/O made it official: AI-generated answers are now front and center in search, and most brands have almost no visibility into how AI…

2026-05-27 13:00 JSTarXiv cs.AIエージェント研究/論文

科学のための Agentic AI の実験

この論文では、科学ワークフローにおける自律型エージェント AI を開発するための 2 つの新しいフレームワークについて詳しく説明します。どちらのシステムも、Google Colab を介したハイブリッド Local Body、Remote Brain アーキテクチャを活用し、Python ベースのローカルオーケストレーターを利用して大規模言語モデル (LLM) クラウドバックエンドを呼び出します。最初のエージェントである DeepTS/DeepCollector は、時系列データセットの大規模なキュレーション、抽出、重複排除を自動化します。 2 つ目の DeepScribe は、視覚的に緻密で数学的に複雑な物理学の講義を構造化された科学レポートに変換する自律型プレゼンテーションアナライザーです。粒度の高い属性抽出 (Cellular RAG)、リモートデータ検査、分散同時実行制御などの実践的なシステムエンジニアリングを通じて、エージェント AI が現在の最先端システムのコンテキストと推論の制限をどのように克服して、科学的なワークフローを厳密にサポートできるかを実証します。最後に、深い知識グラフをサポートする DeepTS の一般化について概説し、この概念的なアプローチの高エネルギー物理学 (DeepQCD) への応用について説明します。

原文 (English)

Experiments in Agentic AI for Science

This paper details two novel frameworks for developing autonomous, agentic AI in scientific workflows. Both systems leverage a hybrid Local Body, Remote Brain architecture via Google Colab, utilizing Python-based local orchestrators to invoke large language model (LLM) cloud backends. The first agent, DeepTS/DeepCollector, automates the large-scale curation, extraction, and deduplication of time-series datasets. The second, DeepScribe, is an autonomous presentation analyzer that converts visually dense, mathematically complex physics lectures into structured scientific reports. Through practical systems engineering-such as granular attribute extraction (Cellular RAG), remote data inspection, and distributed concurrency controls-we demonstrate how agentic AI can overcome the context and reasoning limitations of current state-of-the-art systems to rigorously support scientific workflows. Finally, we outline a generalization of DeepTS to support deep knowledge graphs and discuss the application of this conceptual approach to high-energy physics (DeepQCD).

2026-05-27 07:32 JSTTechCrunch AIエージェント

DuckDuckGo installs are up 30% as users reject being ‘force-fed’ Google’s AI Search

Google overhauled Search at I/O 2026, replacing blue links with AI agents. The backlash has been swift. DuckDuckGo app installs spiked 30%…

2026-05-26 13:00 JSTITmedia AI+LLM/生成AIGemini

Gemini APIが“半額”で使える「Flex」　注意点は？　「Priority」とは何が違う？

Googleは「Gemini」のAPI向けに、新たなサービスティア「Flex」と「Priority」を追加した。Flexは標準サービスティアの半額で利用できるという。両者はどう違い、どう使い分けるべきなのか。

2026-05-26 13:00 JSTarXiv cs.AILLM/生成AIAnthropic Claude

Claude AI Health 引用における典拠シグナル: 典拠シグナルフレームワークを使用した記述分析

この研究は、消費者の健康に関する質問に答える際に、Anthropic の Claude AI が情報源を提示する際に使用する権威シグナルを特定することを目的としています。 LLM が作成する健康関連の引用の質については多くの議論が存在しますが、引用元の情報源の完全性、およびその情報源が医療専門家がどの範囲まで信頼できる情報源であると考えるかについての情報は限られています。この記述的な横断研究では、Google Research が厳選した 3,172 件の消費者の健康に関する質問を含む HealthSearchQA のデータを使用しました。除外後、10,038 件の引用をもたらした 3,075 の質問からなる最終データセットが分析されました。 Authority Signals Framework (Jacques et al., 2026) を適用して、542 ソースの不均衡な層別サンプルについて 4 つのドメインにわたる 10 個の権威シグナルを調査しました。確立された機関情報源がすべての引用文献の 97.8% を占めました (n = 9,818)。組織の種類として最も多く挙げられたのは医療機関 (36.5%) で、続いて政府リソース (31.6%)、専門家協会 (28.4%) でした。商用健康情報は 2.2% (n = 220) を占めました。上位 10 の組織がすべての引用の 57.8% を占め、メイヨークリニックだけで 24.7% を占めました。焦点を当てたサンプルに含まれる商業情報源のうち、86.4% が医学的レビューの記述を表示し、82.5% がスキーママークアップを使用し、71.8% が包括的な内容を持っていた一方、伝統的な機関情報源は、これらの同じマーカーの有無にかかわらずクロードの引用に現れていました。 Anthropic はクロードを HIPAA 対応のヘルスケアアプリケーションとして位置づけており、これらの発見はクロードの引用行動のベースラインを確立し、AI を介した健康情報の継続的なクロスプラットフォーム評価のためのツールとしての Authority Signals Framework の有用性を実証しています。

原文 (English)

Authority Signals in Claude AI Health Citations: A Descriptive Analysis Using the Authority Signals Framework

This study seeks to determine the authority signals used by Anthropic's Claude AI in its presentation of sources when answering consumer health questions. While there exists a great deal of discourse around the quality of health citations that LLMs produce, there is limited information on the integrity of the sources the citations originate from, and to what extent the sources are, from what health professionals would consider, credible sources. This descriptive cross-sectional study used data from HealthSearchQA, which contains 3,172 consumer health questions curated by Google Research. After exclusions, a final dataset of 3,075 questions yielding 10,038 citations was analyzed. The Authority Signals Framework (Jacques et al., 2026) was applied to examine 10 authority signals across four domains for a disproportionate stratified sample of 542 sources. Established institutional sources accounted for 97.8% of all citations (n = 9,818). Medical Institutions were the most frequently cited organization type (36.5%), followed by Government Resources (31.6%) and Professional Associations (28.4%). Commercial Health Information comprised 2.2% (n = 220). The top 10 organizations accounted for 57.8% of all citations, with Mayo Clinic alone representing 24.7%. Among commercial sources in the focused sample, 86.4% displayed medical review statements, 82.5% used schema markup, and 71.8% had comprehensive content, while traditional institutional sources appeared in Claude's citations with or without these same markers. As Anthropic positions Claude for HIPAA-ready healthcare applications, these findings establish a baseline for Claude's citation behavior and demonstrate the utility of the Authority Signals Framework as a tool for ongoing, cross-platform evaluation of AI-mediated health information.

2026-05-26 13:00 JSTarXiv cs.AILLM/生成AIエージェント

Attested Tool-Server Admission: A Security Extension to the Model Context Protocol

The Model Context Protocol (MCP) standardizes how a large-language-model (LLM) agent and an external tool server exchange messages, but not…

2026-05-26 13:00 JSTarXiv cs.AILLM/生成AIOpenAI Gemma Llama

AI Content Moderation in Therapy Conversations

Large language models (LLMs) are increasingly being used for emotional support. They are also being developed for formal therapy purposes.…

2026-05-26 13:00 JSTarXiv cs.AIハードウェア/半導体Gemma

Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

We present the first end-to-end demonstration of fine-tuning and serving Google's Gemma 4 31B model on TPU hardware, providing an empirical…

2026-05-26 13:00 JSTarXiv cs.AILLM/生成AIエージェントAnthropic

A Two-Dimensional Framework for AI Agent Design Patterns: Cognitive Function and Execution Topology

Existing frameworks for LLM-based agent architectures describe systems from a single perspective: industry guides (Anthropic, Google, LangC…

2026-05-26 13:00 JSTarXiv cs.AILLM/生成AIハードウェア/半導体Anthropic Claude OpenAI GPT / ChatGPT

AMEL: Accumulated Message Effects on LLM Judgments

Large language models are routinely used as automated evaluators: to review code, moderate content, or score outputs, often with many items…

2026-05-26 13:00 JSTarXiv cs.AI研究/論文

AI-generated podcasts: Synthetic Intimacy and Cultural Mistranslation in NotebookLM's Audio Overviews

This paper analyses AI-generated podcasts produced by Google's NotebookLM, which generates audio podcasts with two chatty AI hosts discussi…

2026-05-26 13:00 JSTarXiv cs.AI研究/論文Anthropic DeepSeek

The Growing Pains of Frontier Models: When Leaderboards Stop Separating and What to Measure Next

Leaderboards rank frontier models on independent axes but do not reveal whether capabilities reinforce or trade off across releases -- and…

2026-05-25 19:38 JSTITmedia AI+その他

日大、教職員1万人が「Google AI Pro」活用へ

日本大学が、米Googleの教育機関向けAIサブスクリプション「Google AI Pro for Education」を導入する。専任の教職員1万人が利用可能にし、定型業務のさらなる効率化を目指す。グーグル・クラウド・ジャパンが発表した。

2026-05-25 13:00 JSTarXiv cs.AILLM/生成AIClaude

MadEvolve: 大規模な言語モデルを使用したトレーディングシステムの進化的な最適化

私たちは、定量的金融におけるいくつかの一般的なタスクへの LLM 主導のアルゴリズム最適化の適用を検討します。 DeepMind の Alpha-Evolve からインスピレーションを得た汎用アルゴリズム最適化フレームワークである MadEvolve は、計算宇宙論におけるアルゴリズムを最適化するために最近開発されました。ここでは、ビットコイン取引の例で、アルゴリズム取引戦略とアルファ生成を最適化するための MadEvolve の有用性を示します。シミュレーションとバックテストのセットアップでは、シグナル生成のための機能セットの進化、取引戦略の個別コンポーネントの最適化、実行戦略と機能パイプラインの共同進化など、検討したすべてのタスクで大幅な改善を達成しました。さらに、私たちの方法を他のエージェント検索アプローチ、特にクロードコードと比較し、シミュレーション設定での p-ハッキングの確率を慎重に評価します。私たちの調査結果は、アルゴリズム取引と定量的金融における AI 主導のエージェント的アルゴリズムと進化的アルゴリズムの有用性を強く裏付けています。

原文 (English)

MadEvolve: Evolutionary Optimization of Trading Systems with Large Language Models

We explore the application of LLM-driven algorithm optimization to several common tasks in quantitative finance. MadEvolve, a general-purpose algorithm optimization framework inspired by DeepMind's Alpha-Evolve, was recently developed to optimize algorithms in computational cosmology. Here we demonstrate the utility of MadEvolve to optimize algorithmic trading strategies and alpha generation at the example of Bitcoin trading. On our simulation and backtesting setup, we achieve significant improvements on all tasks we considered, such as evolving feature sets for signal generation, optimizing separate components of the trading strategy, and jointly evolving the feature pipeline together with the execution strategy. Additionally, we compare our method to other agentic search approaches, specifically Claude Code, and carefully evaluate p-hacking probabilities on our simulation setup. Our findings strongly support the utility of AI-driven agentic and evolutionary algorithms for algorithmic trading and quantitative finance.

2026-05-25 13:00 JSTarXiv cs.AI研究/論文

勾配ペナルティ付き潜在ダイナミクスを使用してスムーズに夢を見、効率的にサンプリングする

モデルベースの強化学習は、ワールドモデルを学習することでサンプル効率を向上させます。ただし、DreamerV3 などの既存の潜在世界モデルは、学習された遷移ダイナミクスに局所的な滑らかさを明示的に適用しないため、遷移ダイナミクス学習の有用な帰納的バイアスが活用されないままになります。我々は、DreamerV3 用の勾配ペナルティ付き潜在ダイナミクス正則化装置である GPLD を提案します。これは、行単位のヤコビペナルティを事後潜在分布に適用して、局所的に滑らかな遷移学習を促進します。我々は、このペナルティが、離散組み込み状態 MDP における遷移則の有限差分平滑化の連続潜在類似物として解釈できることを示し、ハッチンソン型の確率的プローブを使用して効率的に推定できることを示します。経験的には、DeepMind Control の固有受容タスク全体で、GPLD は総サンプル効率を向上させ、より複雑な移動環境で特に大きな効果をもたらします。より困難な四足歩行タスクでは、GPLD は早期に高収益の動作に達し、長期にわたってより一貫した後期学習を示します。明示的な局所平滑性正則化は、潜在世界モデルを改善して滑らかな連続制御環境を実現する簡単かつ効果的な方法です。 GPLD のコードは github.com/romils9/gpld-mbrl で入手できます。

原文 (English)

Dreaming Smoothly and Sample Efficiently with Gradient Penalized Latent Dynamics

Model-based reinforcement learning improves sample efficiency by learning a world model. However, existing latent world models such as DreamerV3 do not explicitly enforce local smoothness in their learned transition dynamics, leaving a useful inductive bias for transition dynamics learning unexploited. We propose GPLD, a gradient-penalized latent dynamics regularizer for DreamerV3 that applies a row-wise Jacobian penalty to the posterior latent distribution to encourage locally smooth transition learning. We show that this penalty can be interpreted as the continuous-latent analog of finite-difference smoothing of transition laws in discrete embedded-state MDPs, and estimate it efficiently using Hutchinson-style stochastic probes. Empirically, across DeepMind Control proprioceptive tasks, GPLD improves aggregate sample efficiency, with particularly strong gains on higher-complexity locomotion environments. On more challenging quadruped tasks, GPLD reaches high-return behavior earlier and exhibits more consistent late-stage learning over longer horizons. Explicit local smoothness regularization is a simple and effective way to improve latent world models for smooth continuous control environments. Code for GPLD is available at github.com/romils9/gpld-mbrl .

2026-05-25 13:00 JSTarXiv cs.AI研究/論文OpenAI

Reflex: 状態ベースの連続制御における反射対称性の活用による強化学習

強化学習は長い間、サンプル効率の悪さに悩まされてきました。この問題を軽減するための有望なアプローチの 1 つは、グループ不変マルコフ決定プロセス ($G$ 不変 MDP) を活用することです。この方向の既存の研究は、主に $\mathrm{SO(2)}$ などの画像ベースの RL と回転対称性に焦点を当てており、状態ベースの RL と鏡映対称性はほとんど検討されていません。この研究では、状態ベースの連続制御タスクに焦点を当て、オンポリシーとオフポリシーの両方の RL アルゴリズムとシームレスに統合するパラダイムである Reflex を導入することで鏡映対称性を活用します。軸反射と両側反射という 2 つのタイプの反射を形式化し、それらの対応する変換を特徴付けます。 Reflex は、対称性を維持する最適値関数とポリシーの理論的分析に基づいて、原則に基づいた対称性正則化メカニズムを通じて鏡映対称性をポリシー学習に統合します。 Reflex を PPO および SAC と統合し、一連の OpenAI Gym および DeepMind Control ベンチマークで評価し、サンプル効率を向上させながら標準ベースラインを上回る優れたパフォーマンスを実証しました。私たちのコードは https://github.com/TonyStark042/Reflex で入手できます。

原文 (English)

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

Reinforcement learning has long struggled with poor sample efficiency. One promising approach to mitigate this problem is leveraging group-invariant Markov Decision Processes ($G$-invariant MDPs). Existing works in this direction have primarily focused on image-based RL and rotational symmetry such as $\mathrm{SO(2)}$, leaving state-based RL and reflection symmetry largely underexplored. In this work, we focus on state-based continuous control tasks and exploit reflection symmetry by introducing Reflex, a paradigm that seamlessly integrates with both on-policy and off-policy RL algorithms. We formalize two types of reflection-axial reflection and bilateral reflection, and characterize their corresponding transformations. Building on a theoretical analysis of symmetry-preserving optimal value functions and policies, Reflex integrates reflection symmetry into policy learning through principled symmetry regularization mechanisms. We integrate Reflex with PPO and SAC, and evaluate it on a suite of OpenAI Gym and DeepMind Control benchmarks, demonstrating superior performance over standard baselines while improving sample efficiency. Our code is available at https://github.com/TonyStark042/Reflex.

2026-05-25 06:39 JSTTechCrunch AIその他

Everyone is navigating AI security in real time — even Google

We're in the transition period -- all of us.