Posts · Page 3

138 posts · page 3 of 6

Mar 25, 2026

Custom License Creator: an AI-powered tool for generating open source licenses

A Gradio web app that uses AI to recommend existing open source licenses or generate custom ones based on your requirements described in plain English.

Projects AI Open Source Licensing

Mar 25, 2026

Policy Visualiser: exploring global policy approaches with AI-powered clustering

An AI-powered React app that analyzes how different countries approach policy challenges, with interactive clustering visualizations powered by Gemini.

Projects AI Open Source Gemini

Mar 25, 2026

Voice Blog Creator: turning voice recordings into polished blog posts with Gemini

An automated pipeline that converts raw voice recordings into polished blog posts using audio preprocessing, Gemini transcription, and AI-powered formatting.

Projects AI Open Source Gemini

Mar 25, 2026

Voice Analyzer: an AI-powered voice analysis tool built with Gemini

A voice analysis application built with Google AI Studio and the Gemini API, exploring multimodal AI capabilities for audio processing.

Projects AI Open Source Gemini

Mar 25, 2026

Hebrew Date KDE Widget: dual calendar display for Plasma 6

A KDE Plasma 6 widget displaying both Gregorian and Hebrew calendar dates with sunset-aware transitions and multiple format options.

Projects Open Source Linux KDE

Mar 25, 2026

WhatsApp Export Unpacker: forensic-grade chat preservation with a web UI

A Python tool with a local web UI for extracting, reconstructing, and preserving WhatsApp chat exports with voice transcription and anonymization.

Projects Open Source Python WhatsApp

Mar 25, 2026

NFC Reader/Writer: a desktop GUI for reading and writing NFC tags on Linux

A PyQt5 desktop application for reading and writing NFC tags using the ACS ACR1252U reader on Linux, with batch writing and system tray integration.

Projects Open Source Linux NFC

Mar 25, 2026

Vendor agent CLIs: tracking first-party AI coding tools from model providers

A curated list of command-line AI coding tools maintained by the model vendors themselves, from Claude Code to Gemini CLI.

Projects AI Open Source Resource Lists

Mar 25, 2026

Jewish-interest MCP projects: connecting AI to Jewish texts and calendars

A small curated collection of MCP servers for accessing Jewish texts, libraries, and calendar data through AI agents.

Projects AI Open Source MCP

Mar 25, 2026

On MCP and consolidation: why the current setup is painful and what might fix it

Notes and resources on the pain points of MCP adoption and why consolidation through gateways and connectors matters.

Opinions AI Open Source MCP

Mar 25, 2026

MCP gateway model: notes on a better architecture for MCP management

Planning notes and sketches for an MCP gateway architecture that aggregates servers into LAN and WAN gateways.

Opinions AI MCP Architecture

Mar 25, 2026

Make Agent Friendly: preparing codebases for agentic development

A Claude Code plugin that systematically prepares human-built codebases for agentic AI development workflows.

Projects AI Open Source Claude Code

Mar 25, 2026

Declaude: custom text rewriting rules for AI-generated content

A tool for maintaining personalized writing rules and vocabulary preferences, consolidated into a Claude Code slash command.

Projects AI Open Source Tech How-Tos

Mar 25, 2026

Ecosystem Mapper: an AI agent that visualizes technology landscapes

An experimental AI agent that discovers, categorizes, and visualizes technology ecosystems from a single keyword search.

Projects AI Open Source Automation

Mar 25, 2026

Israel-related MCP servers: a curated list

A curated directory of Model Context Protocol servers providing access to Israeli data sources, government APIs, and services.

Projects AI Open Source MCP

Mar 25, 2026

An index of Linux-friendly voice technology tools

A curated index of 100+ voice technology tools accessible to Linux desktop users, from real-time dictation to dev frameworks.

Projects Open Source AI Speech Recognition

Mar 25, 2026

Audio multimodal AI: the models that understand sound, not just transcribe it

A curated resource list of multimodal AI models with native audio support — models that process audio tokens, not just transcribe.

Projects AI Open Source Speech Recognition

Mar 25, 2026

Benchmarking speech-to-text on long-form audio

Comparing 8 STT models on a 27-minute podcast. Local Whisper wins on word accuracy, but cloud APIs dominate punctuation.

Projects AI Open Source Whisper

Mar 25, 2026

Whisper fine-tuning resources I keep coming back to

A short curated list of the best Whisper fine-tuning resources: tutorials, notebooks, and managed compute examples.

AI Open Source Whisper Speech Recognition

Mar 25, 2026

Does fine-tuning Whisper actually improve accuracy?

Evaluating whether fine-tuning Whisper improves transcription accuracy. Spoiler: it depends on model size and use case.

Projects AI Open Source Whisper

Mar 25, 2026

Fine-tuning Whisper on Modal's serverless GPUs

A script for fine-tuning OpenAI's Whisper speech recognition models using Modal's serverless GPU infrastructure.

Projects AI Open Source Whisper

Mar 25, 2026

Deepgram voice keyboard: a Linux virtual keyboard powered by Deepgram Flux

A voice-controlled Linux virtual keyboard using Deepgram's Flux turn-taking STT API, built in Rust.

Projects AI Open Source Speech Recognition

Mar 25, 2026

ASR Training Data Collector: a GUI for gathering speech recognition training data

A GUI tool for collecting audio training data for ASR fine-tuning, with LLM-generated prompts and Hugging Face integration.

Projects AI Open Source Speech Recognition

Mar 25, 2026

Cloud ASR MCP: multi-backend transcription via multimodal LLMs

An MCP server for audio transcription using multimodal LLMs like Gemini, GPT-4o Audio, and Voxtral — not traditional ASR.

Projects AI Open Source MCP

Mar 25, 2026

Gemini Transcription MCP: audio transcription as an MCP tool

An MCP server that brings Gemini-powered audio transcription directly into Claude Code and Claude Desktop.

Projects AI Open Source MCP