Custom License Creator: an AI-powered tool for generating open source licenses
A Gradio web app that uses AI to recommend existing open source licenses or generate custom ones based on your requirements described in plain English.
138 posts · page 3 of 6
A Gradio web app that uses AI to recommend existing open source licenses or generate custom ones based on your requirements described in plain English.
An AI-powered React app that analyzes how different countries approach policy challenges, with interactive clustering visualizations powered by Gemini.
An automated pipeline that converts raw voice recordings into polished blog posts using audio preprocessing, Gemini transcription, and AI-powered formatting.
A voice analysis application built with Google AI Studio and the Gemini API, exploring multimodal AI capabilities for audio processing.
A KDE Plasma 6 widget displaying both Gregorian and Hebrew calendar dates with sunset-aware transitions and multiple format options.
A Python tool with a local web UI for extracting, reconstructing, and preserving WhatsApp chat exports with voice transcription and anonymization.
A PyQt5 desktop application for reading and writing NFC tags using the ACS ACR1252U reader on Linux, with batch writing and system tray integration.
A curated list of command-line AI coding tools maintained by the model vendors themselves, from Claude Code to Gemini CLI.
A small curated collection of MCP servers for accessing Jewish texts, libraries, and calendar data through AI agents.
Notes and resources on the pain points of MCP adoption and why consolidation through gateways and connectors matters.
Planning notes and sketches for an MCP gateway architecture that aggregates servers into LAN and WAN gateways.
A Claude Code plugin that systematically prepares human-built codebases for agentic AI development workflows.
A tool for maintaining personalized writing rules and vocabulary preferences, consolidated into a Claude Code slash command.
An experimental AI agent that discovers, categorizes, and visualizes technology ecosystems from a single keyword search.
A curated directory of Model Context Protocol servers providing access to Israeli data sources, government APIs, and services.
A curated index of 100+ voice technology tools accessible to Linux desktop users, from real-time dictation to dev frameworks.
A curated resource list of multimodal AI models with native audio support — models that process audio tokens, not just transcribe.
Comparing 8 STT models on a 27-minute podcast. Local Whisper wins on word accuracy, but cloud APIs dominate punctuation.
A short curated list of the best Whisper fine-tuning resources: tutorials, notebooks, and managed compute examples.
Evaluating whether fine-tuning Whisper improves transcription accuracy. Spoiler: it depends on model size and use case.
A script for fine-tuning OpenAI's Whisper speech recognition models using Modal's serverless GPU infrastructure.
A voice-controlled Linux virtual keyboard using Deepgram's Flux turn-taking STT API, built in Rust.
A GUI tool for collecting audio training data for ASR fine-tuning, with LLM-generated prompts and Hugging Face integration.
An MCP server for audio transcription using multimodal LLMs like Gemini, GPT-4o Audio, and Voxtral — not traditional ASR.
An MCP server that brings Gemini-powered audio transcription directly into Claude Code and Claude Desktop.