❌

Vue lecture

Il y a de nouveaux articles disponibles, cliquez pour rafraîchir la page.

Remote HTTP access to self-hosted Ollama AI models

Michael Pietroforte

21 août 2025 à 17:06

In my previous post in the Ollama series with OpenAI OSS models, I explained how to install Ollama on an Ubuntu EC2 instance. If you followed my guide, you now have everything set up to remotely access gpt-oss-20b on your self-hosted Ollama installation via HTTP. I will also showcase how to connect to remote AI models in VS Code using popular coding assistants like Cline and Roo Code, as well as through AI apps such as the local Ollama client, BoltAI, and Anything LLM.

Install Ollama with OpenAI gpt-oss-20b on Ubuntu within an EC2 instance

Michael Pietroforte

19 août 2025 à 18:33

In my previous post, I explained why you might want to run OpenAI gpt-oss-20b or gpt-oss-120b on your own virtual machine and how to select an appropriate EC2 instance type. Today, I will show you how to install Ollama via script or snap on Ubuntu within an EC2 instance. Most steps can be adapted for other cloud providers, on-premises setups, or Linux distributions with minor modifications. I will also briefly discuss the benefits of running open-weight models on an EC2 instance with Ollama compared to Bedrock, SageMaker, vLLM, or Hugging Face Transformers.

Running OpenAI gpt-oss-20b and gpt‑oss-120b in AWS on your own EC2 instance

Michael Pietroforte

18 août 2025 à 18:17

OpenAI recently released its first open-weight models, gpt-oss-20b and gpt‑oss-120b, which you can download and run with Ollama. If security and privacy are essential when using AI models, you might not want to use the vendor's chat UI or API. The most powerful AI models require significant computing power and memory, and many desktop devices cannot run these models. Although Ollama was not designed with remote connections in mind, it is still possible. In this series, I will explain how to install OpenAI's open-weight models using Ollama and connect to them remotely from various applications, including VS Code GitHub Copilot. In this first post, I will outline how to choose the right size for your virtual machine, using EC2 instance types in AWS as an example.

Microsoft AI Shell Preview 6: MCP support and great PowerShell integration

Michael Pietroforte

14 août 2025 à 18:24

The key new feature in the just-released AI Shell Preview 6 is run_command_in_terminal, among several new built-in tools that allow the AI model to run commands and review outputs directly in your PowerShell terminal. Additionally, AI Shell now supports local and remote MCP servers, transforming it into a versatile AI assistant across Windows, Mac, and Linux.

ChatGPT Gmail Connector: OpenAI’s next flop?

Michael Pietroforte

13 août 2025 à 17:29

The ChatGPT Gmail connector was first released in June 2025 for Plus, Pro, Team, Enterprise, and Edu users. However, because of alleged data protection rules, users in the European Economic Area, Switzerland, and the UK were excluded. OpenAI is now deploying connectors for Gmail, Google Calendar, Google Drive, and GitHub across Europe for all plans—notably after Anthropic released their MCP connectors weeks ago in Europe without such concerns, suggesting OpenAI's initial data protection claims were merely pretextual or that MCP has no priority status for the AI company. The announcement mentions Google Contacts, but this connector isn't yet accessible in my ChatGPT with a Plus plan. I tested the Gmail Connector, and following GPT-5's disastrous launch, it appears OpenAI released another flop immediately afterward.

GPT-5: Another crash into the AI Wall

Michael Pietroforte

11 août 2025 à 20:44

The web is flooded with complaints from GPT-5 users, prompting OpenAI's CEO to reintroduce GPT-4o as an alternative. This marks the third setback following the departure of chief scientist Ila Sukestver, who led GPT development until version 4. One likely cause of user frustration is that, following Altman's recent boasts, everyone anticipated a major advance towards "AGI," but we are merely observing another crash into the AI Wall.

The new browser war – Does AI and ‘fair use’ kill the open web?

Michael Pietroforte

8 août 2025 à 19:47

In my previous post, I covered Microsoft’s new Edge Copilot Mode, introducing only one significant change. However, shifting Copilot from the sidebar to the center in Edge is more than just a design update. Microsoft’s move marks the first strategic step in the unfolding new browser war involving Google, OpenAI, and Perplexity—companies that exploit broad 'fair use' claims to harvest web content for their own greedy gain. But the upcoming battles aren’t just about who controls the web’s entry point—it could signal the beginning of the end of the web as we know it. The fall of Google’s dominance as the web’s central policymaker now seems inevitable. Once the dust settles from the new browser war, a decentralized web shaped by independently communicating AI agents might emerge. Think that’s far-fetched? Keep reading.

Download OpenAI open-weight models gpt-oss:20/gpt-oss:120 in Ollama and use in VS Code GitHub Copilot

Michael Pietroforte

6 août 2025 à 20:15

OpenAI has just released its first open-weight models, gpt-oss:20 and gpt-oss:120, enabling you to run these surprisingly powerful models locally, which benefits security and privacy. I downloaded gpt-oss:20 and tested it on my laptop using Ollama and within VS Code with GitHub Copilot. To my surprise, the 20 billion parameter model gpt-oss:20 responded to my prompt.

How to enable Edge Copilot Mode

Michael Pietroforte

5 août 2025 à 15:39

In Edge Copilot Mode, announced a few days ago, Copilot shifts from being just an AI assistant to becoming the main browser interface. Now front and center in Microsoft's browser, Copilot has evolved into a serious ChatGPT competitor. In this post, I explain how to enable Edge Copilot Mode and discuss whether it introduces any new features compared to the previous version of Copilot in Edge.

Windows-MCP: Automating the Windows GUI with AI

Michael Pietroforte

1 août 2025 à 21:18

Windows-MCP is an open-source MCP server that enables AI agents to control the Windows operating system, supporting tasks such as file navigation, application launching, and GUI automation. It features mouse/keyboard input tools, clipboard management, and window and app state detection. I tested Windows-MCP, one of the extensions listed in Anthropic Claude Desktop that can be installed using the free plan.

Can ChatGPT Agent Mode solve Google CAPTCHA (reCAPTCHA) and validate the Cloudflare Turnstile challenge?

Michael Pietroforte

30 juillet 2025 à 16:58

Reports indicate that ChatGPT Agent Mode might be capable of solving CAPTCHAs, which are often the final line of defense against persistent bots like AI crawlers on many websites. However, these claims originate from Reddit, known for the frequent hallucinations of its users. To verify, I decided to test it myself.

Testing ChatGPT Agent Mode: A flawed concept?

Michael Pietroforte

29 juillet 2025 à 18:23

The new ChatGPT Agent Mode combines the Operator's ability to browse and interact with websites with Deep Research's ability to analyze and synthesize information from multiple sources. I tested ChatGPT's Agent Mode and was quite disappointed. Honestly, I find it surprising that OpenAI made this feature available to the public, considering that they have led the way with their chatbot so far. In my view, ChatGPT's Agent Mode is a flawed concept because it forces an AI agent to use a web browser, a relatively primitive tool designed for inferior human intelligence.

Is AI overhyped?

Michael Pietroforte

25 juillet 2025 à 17:08

Are you skeptical about AI? Do you believe that AI is overhyped? Then, the latest episode of the Big Technology podcast is for you. AI champion Alex Kantrowitz and fierce AI critic Ed Zitron tackle the gap between AI hype and reality. However, both prominent journalists overlook an essential aspect when assessing AI's economic potential.

AI-powered email search with Claude Gmail Search, Zapier MCP server, GongRzhe’s MCP Server, and Gemini in Gmail

Michael Pietroforte

23 juillet 2025 à 19:44

I tested three MCP servers in Anthropic Claude Sonnet 4: Claude Gmail Search, Zapier Gmail MCP server, and GongRzhe’s Gmail MCP Server. I also submitted the same search prompt to Gemini in Gmail. The results varied greatly. Only two AI tools passed my test, one failed, and the most promising MCP server turned out to be useless.

Anthropic Claude desktop extensions, connectors vs. local, remote MCP servers

Michael Pietroforte

21 juillet 2025 à 15:11

Anthropic has introduced two new concepts for connecting AI models to resources through MCP: desktop extensions and connectors. Extensions are available only on Claude Desktop (Windows and Mac), while connectors are supported on both the desktop and the Claude.ai web app. All desktop extensions are local MCP servers, but not all local MCP servers qualify as extensions. Conversely, all remote MCP servers are considered connectors; however, not all connectors are remote MCP servers. Although some theoretical documentation about these new concepts exists, I mainly categorized them based on how Claude Desktop organizes the different types of MCP servers.

New in VS Code GitHub Copilot: Command allow/deny lists, resubmit requests, MCP server catalog

Michael Pietroforte

16 juillet 2025 à 20:35

VS Code version 1.102 introduced several useful new features in GitHub Copilot. Command allow/deny lists let you specify which commands the Copilot agent can execute without asking for permission. The resubmit feature enables you to edit a previous request and resubmit the prompt. Additionally, the new VS Code version includes various updates for managing MCP servers, including a curated catalog of MCP servers.

METR study: Is AI-assisted coding overhyped?

Michael Pietroforte

14 juillet 2025 à 17:17

A new study by METR got a lot of attention online. It claims that developers often overestimate the productivity boost from AI tools. In a coding test, the researchers found that AI caused a 19% "slowdown" for developers. In my opinion, it's not AI coding that's overhyped, but rather the study's findings.

GitHub Copilot concepts: An overview of all services and tools

Michael Pietroforte

11 juillet 2025 à 15:25

The number of GitHub Copilot concepts continues to grow, and their similar names can easily confuse. This post offers an overview of all the concepts, along with a link to find more information. The main confusion stems from the Copilot tools on GitHub.com and GitHub Copilot in VS Code. Although there is some overlap, keeping these two areas separate is essential.

Google Gemini CLI: Free AI-powered terminal assistant for the IT admin

Michael Pietroforte

1 juillet 2025 à 18:11

Google's new Gemini CLI allows up to 60 model requests per minute and 1,000 daily requests. Professional developers might hit these limits quickly, but many IT admins will find the free plan sufficient. Compared to Anthropic Claude Code and OpenAI Codex CLI, I find Google's open-source AI-powered terminal more useful for the AI-augmented admin.