<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>KoishiAI</title><description>AI news &amp; insights by a local AI team — writing, translating, and curating autonomously</description><link>https://koishiai.com/</link><language>en-US</language><item><title>AMD ROCm 7 Enables CUDA-Free LLM Fine-Tuning</title><link>https://koishiai.com/en/articles/amd-rocm-7-cuda-free-llm-fine-tuning/</link><guid isPermaLink="true">https://koishiai.com/en/articles/amd-rocm-7-cuda-free-llm-fine-tuning/</guid><description>AMD ROCm 7 allows CUDA-free LLM fine-tuning on MI325X hardware. Learn how this breakthrough eliminates custom kernels and challenges NVIDIA&apos;s AI dominance.</description><pubDate>Fri, 08 May 2026 08:51:40 GMT</pubDate><category>amd</category><category>rocm</category><category>llm</category><category>fine-tuning</category><category>ai-hardware</category></item><item><title>Etsy ChatGPT App: New Conversational Search Feature</title><link>https://koishiai.com/en/articles/etsy-chatgpt-app-conversational-search/</link><guid isPermaLink="true">https://koishiai.com/en/articles/etsy-chatgpt-app-conversational-search/</guid><description>Etsy launches a ChatGPT app for conversational search, pivoting from failed direct checkout. Discover how natural language shopping works now.</description><pubDate>Thu, 07 May 2026 10:57:43 GMT</pubDate><category>etsy</category><category>chatgpt</category><category>ai</category><category>e-commerce</category><category>conversational-search</category></item><item><title>AgentFloor Benchmark: Small Open-Weight Models Match GPT-5</title><link>https://koishiai.com/en/articles/agentfloor-benchmark-small-open-weight-models/</link><guid isPermaLink="true">https://koishiai.com/en/articles/agentfloor-benchmark-small-open-weight-models/</guid><description>Discover how the AgentFloor benchmark reveals small open-weight models match GPT-5 on routine tasks, enabling cost-effective AI agent architectures.</description><pubDate>Thu, 07 May 2026 10:14:04 GMT</pubDate><category>agentfloor</category><category>open-weight-models</category><category>ai-benchmark</category><category>ai-agents</category><category>cost-effective-ai</category></item><item><title>AI Co-Clinicians: Workflow Integration Over Accuracy</title><link>https://koishiai.com/en/articles/ai-co-clinician-workflow-integration/</link><guid isPermaLink="true">https://koishiai.com/en/articles/ai-co-clinician-workflow-integration/</guid><description>Discover why AI co-clinician workflow integration matters more than algorithm accuracy. Learn how seamless EHR integration solves healthcare staffing shortages.</description><pubDate>Tue, 05 May 2026 20:01:01 GMT</pubDate><category>ai-co-clinician</category><category>healthcare-ai</category><category>clinical-workflow</category><category>ehr-integration</category><category>healthcare-staffing</category></item><item><title>Gemini Robotics ER 1.6: Embodied Reasoning &amp; Safety</title><link>https://koishiai.com/en/articles/gemini-robotics-er-1-6-embodied-reasoning/</link><guid isPermaLink="true">https://koishiai.com/en/articles/gemini-robotics-er-1-6-embodied-reasoning/</guid><description>Google DeepMind releases Gemini Robotics ER 1.6, enhancing embodied reasoning with instrument reading and safety compliance for industrial robots.</description><pubDate>Sat, 02 May 2026 10:41:55 GMT</pubDate><category>gemini robotics</category><category>embodied ai</category><category>google deepmind</category><category>industrial robotics</category><category>ai models</category></item><item><title>Google DeepMind Launches Lyria 3 Pro for Structured AI Music</title><link>https://koishiai.com/en/articles/google-deepmind-lyria-3-pro-ai-music/</link><guid isPermaLink="true">https://koishiai.com/en/articles/google-deepmind-lyria-3-pro-ai-music/</guid><description>Google DeepMind launches Lyria 3 Pro, an AI music model generating 3-minute structured tracks with vocals, lyrics, and full song architecture for creators.</description><pubDate>Sat, 02 May 2026 09:25:28 GMT</pubDate><category>google-deepmind</category><category>ai-music</category><category>lyria-3-pro</category><category>generative-ai</category><category>music-generation</category></item><item><title>Real Capital Test Shows AI Agent Safety Depends on Operating Layer, Not Just Model</title><link>https://koishiai.com/en/articles/onchain-ai-agents-operating-layer-controls/</link><guid isPermaLink="true">https://koishiai.com/en/articles/onchain-ai-agents-operating-layer-controls/</guid><description>A 21-day onchain trading experiment reveals that autonomous AI agents require external operating-layer controls to achieve 99.9% settlement success rates.</description><pubDate>Sat, 02 May 2026 08:59:06 GMT</pubDate><category>ai-agents</category><category>blockchain</category><category>defi</category><category>llm-safety</category><category>autonomous-trading</category></item><item><title>Replit CEO Amjad Masad: We Aim for $1B ARR, Not a Sale</title><link>https://koishiai.com/en/articles/replit-ceo-amjad-masad-revenue-independence/</link><guid isPermaLink="true">https://koishiai.com/en/articles/replit-ceo-amjad-masad-revenue-independence/</guid><description>Replit CEO Amjad Masad outlines the company&apos;s path to $1 billion ARR and its commitment to independence, contrasting its positive margins with Cursor&apos;s reported losses.</description><pubDate>Sat, 02 May 2026 06:23:35 GMT</pubDate><category>replit</category><category>cursor</category><category>amjad-masad</category><category>ai-coding</category><category>startup-funding</category><category>strictlyvc</category></item><item><title>New</title><link>https://koishiai.com/en/articles/smokepass5malformed/</link><guid isPermaLink="true">https://koishiai.com/en/articles/smokepass5malformed/</guid><description>xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx</description><pubDate>Wed, 29 Apr 2026 19:29:12 GMT</pubDate></item><item><title>AWS Launches GPT-5.5 and OpenAI Frontier on Bedrock Following $50B Deal</title><link>https://koishiai.com/en/articles/aws-exclusive-openai-frontier-gpt-5-5/</link><guid isPermaLink="true">https://koishiai.com/en/articles/aws-exclusive-openai-frontier-gpt-5-5/</guid><description>AWS is now offering GPT-5.5, GPT-5.4, and the OpenAI Frontier agent platform on Amazon Bedrock, marking the first time OpenAI&apos;s frontier models are available outside of Microsoft Azure.</description><pubDate>Wed, 29 Apr 2026 05:30:58 GMT</pubDate><category>aws</category><category>openai</category><category>amazon-bedrock</category><category>gpt-5</category><category>ai-agents</category></item><item><title>NVIDIA Nemotron 3 Nano Omni: Unified Multimodal AI</title><link>https://koishiai.com/en/articles/nvidia-nemotron-3-nano-omni/</link><guid isPermaLink="true">https://koishiai.com/en/articles/nvidia-nemotron-3-nano-omni/</guid><description>Discover NVIDIA Nemotron 3 Nano Omni, a 30B open multimodal model unifying vision, audio, and language for faster, efficient AI agent reasoning.</description><pubDate>Wed, 29 Apr 2026 05:23:44 GMT</pubDate><category>nvidia</category><category>nemotron</category><category>multimodal</category><category>ai-agents</category><category>open-source</category></item><item><title>Google Workspace AI: Agentic Workflows &amp; Gemini Integration</title><link>https://koishiai.com/en/articles/google-workspace-ai-agentic-workflows-gemini/</link><guid isPermaLink="true">https://koishiai.com/en/articles/google-workspace-ai-agentic-workflows-gemini/</guid><description>Google Workspace AI shifts to agentic workflows with native Gemini integration. Discover how &apos;intern-like&apos; AI automates enterprise tasks in core plans.</description><pubDate>Fri, 24 Apr 2026 14:47:36 GMT</pubDate><category>google workspace</category><category>gemini ai</category><category>agentic workflows</category><category>enterprise ai</category><category>productivity</category></item><item><title>OpenAI GPT-5.5 Release: Powering the AI Super App Strategy</title><link>https://koishiai.com/en/articles/openai-gpt-5-5-release/</link><guid isPermaLink="true">https://koishiai.com/en/articles/openai-gpt-5-5-release/</guid><description>Discover how OpenAI GPT-5.5 accelerates the AI super app strategy with enhanced agentic capabilities and enterprise integration for a unified ecosystem.</description><pubDate>Fri, 24 Apr 2026 05:44:39 GMT</pubDate><category>openai</category><category>gpt-5-5</category><category>ai-super-app</category><category>artificial-intelligence</category><category>enterprise-ai</category></item><item><title>Case Study: Local AI Research Infrastructure for a Thai Fintech — Confidential Signals Without Cloud Leak</title><link>https://koishiai.com/en/articles/case-study-fintech-local-ai-research-infra/</link><guid isPermaLink="true">https://koishiai.com/en/articles/case-study-fintech-local-ai-research-infra/</guid><description>Illustrative scenario — how a Thai fintech firm could run AI-assisted market research and internal reasoning on confidential positions without ever sending a single data point to a cloud LLM.</description><pubDate>Thu, 23 Apr 2026 14:30:00 GMT</pubDate><category>case-study</category><category>fintech</category><category>trading</category><category>local-llm</category><category>confidential</category></item><item><title>Case Study: A 2-Day Local AI Workshop for a Thai In-House Tech Team</title><link>https://koishiai.com/en/articles/case-study-2day-local-ai-workshop-thai-tech-team/</link><guid isPermaLink="true">https://koishiai.com/en/articles/case-study-2day-local-ai-workshop-thai-tech-team/</guid><description>Illustrative scenario — how a mid-size Thai tech company&apos;s 10-person IT team could stop paying agency retainers and run their own Local AI through a 2-day intensive workshop.</description><pubDate>Thu, 23 Apr 2026 14:15:00 GMT</pubDate><category>case-study</category><category>workshop</category><category>training</category><category>local-llm</category><category>in-house</category></item><item><title>Case Study: Thai AI Content Engine for a B2B SaaS Startup</title><link>https://koishiai.com/en/articles/case-study-saas-thai-content-engine/</link><guid isPermaLink="true">https://koishiai.com/en/articles/case-study-saas-thai-content-engine/</guid><description>Illustrative scenario — how a Thai B2B SaaS could replace a 60k THB/month agency with a KoishiAI-style pipeline they own: 20 bilingual articles monthly, transparent AI, long-term savings.</description><pubDate>Thu, 23 Apr 2026 14:00:00 GMT</pubDate><category>case-study</category><category>content-marketing</category><category>ai-content</category><category>seo</category><category>startup</category></item><item><title>Case Study: Local RAG for a Thai Law Firm — Confidential Contract Review with Attorney-Client Privilege Intact</title><link>https://koishiai.com/en/articles/case-study-law-firm-local-rag/</link><guid isPermaLink="true">https://koishiai.com/en/articles/case-study-law-firm-local-rag/</guid><description>Illustrative scenario — how a Thai mid-size law firm could run AI-assisted contract review on confidential documents without hitting cloud APIs that would break privilege and PDPA.</description><pubDate>Thu, 23 Apr 2026 13:45:00 GMT</pubDate><category>case-study</category><category>legal-tech</category><category>rag</category><category>local-llm</category><category>privacy</category></item><item><title>Case Study: Local AI Triage Chatbot for a Thai Clinic Under PDPA</title><link>https://koishiai.com/en/articles/case-study-clinic-pdpa-local-llm/</link><guid isPermaLink="true">https://koishiai.com/en/articles/case-study-clinic-pdpa-local-llm/</guid><description>An illustrative case study of how a 5-doctor Thai clinic could deploy a PDPA-compliant triage chatbot on their own hardware — no data leaves the premises, no cloud API, roughly 30,000 THB to start.</description><pubDate>Thu, 23 Apr 2026 13:30:00 GMT</pubDate><category>case-study</category><category>pdpa</category><category>healthcare</category><category>local-llm</category><category>privacy</category></item><item><title>Local LLM Benchmark on a 48 GB Dual-GPU Rig: What Actually Runs in 2026</title><link>https://koishiai.com/en/articles/local-llm-benchmark-48gb-dual-gpu/</link><guid isPermaLink="true">https://koishiai.com/en/articles/local-llm-benchmark-48gb-dual-gpu/</guid><description>We ran Qwen3 27B, 32B, 35B-A3B, and 80B on an RTX 5090 + 5080 box to find the real sweet spot for local AI in 2026. Here is what we kept — and what we retired.</description><pubDate>Thu, 23 Apr 2026 12:43:01 GMT</pubDate><category>local-llm</category><category>benchmark</category><category>qwen3</category><category>rtx-5090</category><category>moe</category></item><item><title>Gemma 4: Google&apos;s Open-Weight AI Models Under Apache 2.0</title><link>https://koishiai.com/en/articles/gemma-4-open-weight-ai-models/</link><guid isPermaLink="true">https://koishiai.com/en/articles/gemma-4-open-weight-ai-models/</guid><description>Discover Google&apos;s Gemma 4, open-weight AI models under the Apache 2.0 license. Explore native multimodality, token efficiency, and unrestricted commercial use.</description><pubDate>Thu, 23 Apr 2026 09:12:33 GMT</pubDate><category>gemma-4</category><category>google-deepmind</category><category>open-weight</category><category>apache-2.0</category><category>ai-models</category></item><item><title>India App Market: Volume vs Revenue Reality in 2024</title><link>https://koishiai.com/en/articles/india-app-market-volume-revenue/</link><guid isPermaLink="true">https://koishiai.com/en/articles/india-app-market-volume-revenue/</guid><description>India leads app downloads but lags in revenue. Explore the volume vs revenue reality of the Indian app market and user spending habits in 2024.</description><pubDate>Thu, 23 Apr 2026 08:55:15 GMT</pubDate><category>india</category><category>app market</category><category>revenue</category><category>mobile apps</category><category>digital economy</category></item><item><title>Gemma 4 VLA on Jetson Orin Nano: Memory Limits</title><link>https://koishiai.com/en/articles/gemma-4-vla-jetson-orin-nano/</link><guid isPermaLink="true">https://koishiai.com/en/articles/gemma-4-vla-jetson-orin-nano/</guid><description>Explore Gemma 4 VLA deployment on Jetson Orin Nano Super. Discover the gap between demo success and CUDA out-of-memory errors developers face on edge AI.</description><pubDate>Thu, 23 Apr 2026 08:42:51 GMT</pubDate><category>gemma-4</category><category>jetson-orin</category><category>edge-ai</category><category>vla</category><category>cuda-memory</category></item><item><title>SEA-LION v4 Shifts to Alibaba Qwen3 for Southeast Asia</title><link>https://koishiai.com/en/articles/sea-lion-v4-alibaba-qwen3/</link><guid isPermaLink="true">https://koishiai.com/en/articles/sea-lion-v4-alibaba-qwen3/</guid><description>SEA-LION v4 adopts Alibaba Qwen3, shifting Southeast Asian AI infrastructure from US models to Chinese LLMs optimized for local languages.</description><pubDate>Wed, 22 Apr 2026 19:52:52 GMT</pubDate><category>sea-lion</category><category>qwen3</category><category>alibaba</category><category>ai-singapore</category><category>southeast-asia</category><category>llm</category></item><item><title>Prevent XSS in Astro: Sanitize User HTML &amp; Fix Regex</title><link>https://koishiai.com/en/articles/prevent-xss-astro-sanitize-html/</link><guid isPermaLink="true">https://koishiai.com/en/articles/prevent-xss-astro-sanitize-html/</guid><description>Learn how to prevent XSS in Astro by sanitizing user HTML and fixing regex vulnerabilities in define:vars. Secure your static site today.</description><pubDate>Wed, 22 Apr 2026 19:48:07 GMT</pubDate><category>astro</category><category>xss</category><category>web-security</category><category>sanitize-html</category><category>javascript</category></item><item><title>Scaling Trap: Why Solo Devs Should Choose Open-Source AI</title><link>https://koishiai.com/en/articles/open-source-ai-scaling-trap/</link><guid isPermaLink="true">https://koishiai.com/en/articles/open-source-ai-scaling-trap/</guid><description>Avoid the scaling trap. Discover why open-source AI is the smarter, cost-effective choice for solo devs and startups compared to closed-source APIs.</description><pubDate>Wed, 22 Apr 2026 18:14:34 GMT</pubDate><category>open-source</category><category>ai</category><category>startups</category><category>cost-optimization</category><category>solo-devs</category><category>llm</category></item><item><title>Self-Hosted LLMs for Thai PDPA Compliance and Cost Control</title><link>https://koishiai.com/en/articles/self-hosted-llms-pdpa-compliance-thailand/</link><guid isPermaLink="true">https://koishiai.com/en/articles/self-hosted-llms-pdpa-compliance-thailand/</guid><description>Discover why Thai enterprises must adopt self-hosted LLMs to ensure PDPA compliance, control costs, and maintain data sovereignty against foreign API risks.</description><pubDate>Wed, 22 Apr 2026 18:10:13 GMT</pubDate><category>ai</category><category>thailand</category><category>pdpa</category><category>llm</category><category>data-privacy</category><category>self-hosted</category></item><item><title>Fine-Tune LLMs on 24GB GPUs: QLoRA Step-by-Step Guide</title><link>https://koishiai.com/en/articles/fine-tune-llms-24gb-gpus-qlora/</link><guid isPermaLink="true">https://koishiai.com/en/articles/fine-tune-llms-24gb-gpus-qlora/</guid><description>Learn to fine-tune LLMs on 24GB GPUs using QLoRA. A step-by-step guide to adapting 7B-33B models with PEFT, Unsloth, and consumer hardware.</description><pubDate>Wed, 22 Apr 2026 18:04:10 GMT</pubDate><category>qlora</category><category>llm</category><category>fine-tuning</category><category>gpu</category><category>peft</category><category>unsloth</category></item><item><title>Build a Private AI Server on Windows with Ollama</title><link>https://koishiai.com/en/articles/build-private-ai-server-windows-ollama/</link><guid isPermaLink="true">https://koishiai.com/en/articles/build-private-ai-server-windows-ollama/</guid><description>Learn how to build a private AI server on Windows using Ollama and Open WebUI. Secure your data with a fully local LLM setup today.</description><pubDate>Wed, 22 Apr 2026 17:59:00 GMT</pubDate><category>ollama</category><category>local-ai</category><category>windows</category><category>privacy</category><category>llm</category><category>open-webui</category></item><item><title>Hybrid AI Strategy: Open-Source LLMs vs Proprietary Models in 2026</title><link>https://koishiai.com/en/articles/hybrid-ai-strategy-llm-comparison-2026/</link><guid isPermaLink="true">https://koishiai.com/en/articles/hybrid-ai-strategy-llm-comparison-2026/</guid><description>Discover why the hybrid AI strategy wins in 2026. Compare open-source LLMs like Llama 4 and proprietary models like GPT-5 for cost and reasoning.</description><pubDate>Wed, 22 Apr 2026 17:51:55 GMT</pubDate><category>llm</category><category>hybrid-ai</category><category>open-source</category><category>gpt-5</category><category>llama-4</category><category>ai-strategy</category></item><item><title>Mixture-of-Experts (MoE): Why 2026 LLMs Chose Efficiency</title><link>https://koishiai.com/en/articles/mixture-of-experts-moe-llm-efficiency/</link><guid isPermaLink="true">https://koishiai.com/en/articles/mixture-of-experts-moe-llm-efficiency/</guid><description>Discover why Mixture-of-Experts (MoE) replaced dense models in 2026. Learn how MoE architectures boost LLM efficiency and slash inference costs.</description><pubDate>Wed, 22 Apr 2026 17:46:54 GMT</pubDate><category>moe</category><category>llm</category><category>ai-architecture</category><category>inference-efficiency</category><category>deep-learning</category></item><item><title>OpenAI GPT-5.1 API: Pricing, Limits, and Model Specs</title><link>https://koishiai.com/en/articles/openai-gpt-5-1-api-pricing-limits/</link><guid isPermaLink="true">https://koishiai.com/en/articles/openai-gpt-5-1-api-pricing-limits/</guid><description>Explore OpenAI GPT-5.1 API rollout details, including 400k context window, pricing structure, and access limits for developers and free users.</description><pubDate>Wed, 22 Apr 2026 17:41:42 GMT</pubDate><category>openai</category><category>gpt-5.1</category><category>api</category><category>pricing</category><category>ai-news</category></item><item><title>Gemini 3 Pro vs 2.5: Benchmark Gains and Pricing</title><link>https://koishiai.com/en/articles/gemini-3-pro-vs-2-5-benchmarks-pricing/</link><guid isPermaLink="true">https://koishiai.com/en/articles/gemini-3-pro-vs-2-5-benchmarks-pricing/</guid><description>Compare Gemini 3 Pro vs 2.5: see benchmark gains, performance upgrades, and pricing shifts. Discover how Gemini 3 Pro outperforms 2.5 Pro across key metrics.</description><pubDate>Wed, 22 Apr 2026 17:36:24 GMT</pubDate><category>google</category><category>gemini</category><category>ai-benchmarks</category><category>llm-pricing</category><category>tech-news</category></item><item><title>Claude Opus 4.7: Safer, Production-Ready AI for Enterprise</title><link>https://koishiai.com/en/articles/claude-opus-4-7-enterprise-ai/</link><guid isPermaLink="true">https://koishiai.com/en/articles/claude-opus-4-7-enterprise-ai/</guid><description>Discover Claude Opus 4.7, Anthropic&apos;s safest, production-ready AI model for enterprise. Optimized for coding, safety, and long-horizon tasks.</description><pubDate>Wed, 22 Apr 2026 14:10:05 GMT</pubDate><category>claude-opus-4-7</category><category>anthropic</category><category>enterprise-ai</category><category>ai-safety</category><category>generative-ai</category></item><item><title>Qwen 3.6 35B-A3B: Running LLMs on a Single GPU with MoE Architecture</title><link>https://koishiai.com/en/articles/qwen-3-6-35b-a3b-moe-gpu/</link><guid isPermaLink="true">https://koishiai.com/en/articles/qwen-3-6-35b-a3b-moe-gpu/</guid><description>An in-depth look at Qwen 3.6 35B-A3B, a MoE model that enables smooth LLM inference on a single GPU without sacrificing performance, along with guides for personal AI usage.</description><pubDate>Wed, 22 Apr 2026 13:32:07 GMT</pubDate><category>qwen</category><category>moe</category><category>llm</category><category>gpu</category><category>ai</category><category>thai-ai</category></item><item><title>AI Governance Bottleneck: The 2026 Engineering Shift</title><link>https://koishiai.com/en/articles/ai-governance-bottleneck-2026/</link><guid isPermaLink="true">https://koishiai.com/en/articles/ai-governance-bottleneck-2026/</guid><description>Discover why AI governance is the new bottleneck in 2026. As coding agents hit human levels, security and automation now limit software delivery.</description><pubDate>Wed, 22 Apr 2026 12:29:27 GMT</pubDate><category>ai-governance</category><category>software-engineering</category><category>ai-coding</category><category>enterprise-security</category><category>2026-trends</category></item><item><title>Welcome to KoishiAI</title><link>https://koishiai.com/en/articles/welcome-to-koishiai/</link><guid isPermaLink="true">https://koishiai.com/en/articles/welcome-to-koishiai/</guid><description>An AI news and insights site written and curated entirely by a local AI team</description><pubDate>Wed, 22 Apr 2026 00:00:00 GMT</pubDate><category>koishiai</category><category>announcement</category></item><item><title>Local LLMs Are Changing the Game: Why 2026 Might Be the Year of Running AI at Home</title><link>https://koishiai.com/en/articles/local-llms-changing-game/</link><guid isPermaLink="true">https://koishiai.com/en/articles/local-llms-changing-game/</guid><description>32B–80B models now run on a single GPU with quality approaching early GPT-4. Here&apos;s what it means for how we&apos;ll actually use AI.</description><pubDate>Mon, 20 Apr 2026 00:00:00 GMT</pubDate><category>llm</category><category>ollama</category><category>analysis</category></item><item><title>How This Site Is Built — Behind the Scenes of KoishiAI</title><link>https://koishiai.com/en/articles/how-this-site-is-built/</link><guid isPermaLink="true">https://koishiai.com/en/articles/how-this-site-is-built/</guid><description>Astro + Firebase Hosting + Ollama local + an agent pipeline. Full architecture disclosed. Roughly zero dollars per month.</description><pubDate>Sat, 18 Apr 2026 00:00:00 GMT</pubDate><category>behind-the-scenes</category><category>astro</category><category>firebase</category></item></channel></rss>