AI tekst-til-tale Jobs
I’m looking for someone Pakistani know the Punjabi language who already knows the ins and outs of ElevenLabs and can set up a high-quality text-to-speech clone of my own voice for personal use. I’ll supply clean audio samples of myself; you’ll handle every technical step inside the ElevenLabs interface—uploading, training, fine-tuning, and validating until the synthesized speech sounds natural and consistent with my real tone and inflection. Once the model is ready, I’d like a quick walkthrough of best practices: which settings give the most realistic results, how to adjust style or stability, and how to generate audio both in the web dashboard and through the API so I can use the voice in my private projects. Deliverables • A fully trained, ready-to-...
I am putting together a low-cost, 10.5 GHz phased-array radar that must spot and track any flying object out to roughly 25–30 km. The core architecture is Pulse-LFM and the whole unit has to stay hand-held and field-serviceable, so every gram and watt matters. I am looking for someone whose background is firmly rooted in radar system development; experience with phased-array hardware, RF front-ends, and real-time signal processing is essential. Off-the-shelf evaluation boards or turnkey kits will not meet our needs—we have to build the hardware from the ground up. Re-using or adapting open-source material on the software side is welcome, and I am open to AI-assisted techniques for algorithm design or UI generation, as long as the end product remains fully traceable and maintai...
Smart Narrator AI - Context-Aware Text-to-Speech Transform boring text into emotionally intelligent, expressive speech Project Overview Smart Narrator AI is an advanced text-to-speech system that understands emotional context and adapts voice characteristics accordingly. Instead of robotic, flat narration, this system analyzes text intent and speaks it with appropriate tone, pace, and emotion. The Problem with Regular TTS Standard TTS Output: "WARNING! System failure!" (monotone, same as everything else) Smart Narrator AI Output: "WARNING! System failure!" (fast, urgent, high pitch - sounds like actual emergency) The Solution: Adaptive Prosody Generation This project implements context-aware prosody generation - the AI decides HOW to speak based on WHAT the text mean...
POSITION BRIEF: Discover Henderson – AI Platform & Systems Manager Discover Henderson is building a next generation tourism platform powered by AI. We are seeking a technical operator who can oversee our entire digital ecosystem — including our website, automations, data systems, and our AI concierge, Ava. This role ensures the platform runs smoothly, evolves continuously, and delivers an exceptional experience for visitors and local partners. ⭐ Role Title AI Platform Manager / No Code Systems Integrator (Wix + Voiceflow + + Airtable) ⭐ Role Summary You will manage and optimize the full Discover Henderson platform, including the website, partner onboarding systems, automations, data flows, and Ava — our AI concierge. Your job is to maintain stability, improve funct...
Goal: Create a FULLY AUTOMATED process that takes a male audio file and converts it into a female voice. What you must do: 1) Take the male audio I provide 2) Convert it into a female voice 3) Upload the final audio into a Google Drive folder 4) Add the Google Drive link in your competition entry 5) Explain clearly what software/tools you will use 6) Explain clearly how you will automate the FULL process from start to finish Important: - The automation must run locally - The final voice must sound perfectly natural and human - The female voice must correctly reproduce the multiple emotions, tone and intonations from the original audio - The result must NOT sound robotic or AI-generated - The automation must be able to process multiple audio files - Do NOT clean the original audio more...
Goal: Create a FULLY AUTOMATED process that takes a male audio file and converts it into a female voice. What you must do: 1) Take the male audio I provide 2) Convert it into a female voice 3) Upload the final audio into a Google Drive folder 4) Add the Google Drive link in your competition entry 5) Explain clearly what software/tools you will use 6) Explain clearly how you will automate the FULL process from start to finish Important: - The automation must run locally - The final voice must sound perfectly natural and human - The female voice must correctly reproduce the multiple emotions, tone and intonations from the original audio - The result must NOT sound robotic or AI-generated - The automation must be able to process multiple audio files - Do NOT clean the original audio more...
I need a ten-minute YouTube video built entirely with AI-driven 3D animated characters. The piece must carry a professional, serious tone—think corporate explainer rather than cartoon—while still feeling visually engaging. Precise, frame-accurate lip sync is critical. Whether you connect a pre-recorded voice-over I supply or generate a natural-sounding AI voice yourself, the mouth movements have to match flawlessly throughout the full ten minutes. Please use whichever tools you trust—Unreal Engine’s MetaHuman Animator, Blender with FaceWare, or other reliable AI lip-sync solutions—as long as the final result looks polished and on beat. I will provide the script, branding assets, and any reference footage once we begin. Your deliverables are: • A 1920&t...
Siamo uno studio commercialistico alla ricerca di un consulente freelance specializzato in intelligenza artificiale, con esperienza nell’analisi dei processi aziendali e nella progettazione di soluzioni personalizzate. L’obiettivo è individuare come integrare l’AI nella nostra organizzazione per migliorare efficienza, automazione e qualità del lavoro, nel rispetto di riservatezza, sicurezza dei dati e normative applicabili. Attività richieste: Analisi del contesto organizzativo e dei processi interni dello studio. Individuazione delle aree in cui l’AI può apportare valore concreto. Proposta di soluzioni personalizzate e realistiche per le nostre esigenze. Eventuale automazione di attività ripetitive o documentali. Definizion...
We are looking for a HIGH-LEVEL AI conversation engineer to help polish and optimize a live AI phone ordering system already running on Twilio + n8n + ElevenLabs + OpenAI. IMPORTANT: The backend infrastructure and payment loop are already mostly working. We are NOT looking for someone to rebuild the platform. We specifically need someone strong in: - AI prompting - conversational flow optimization - reducing hesitation/repetition - human-like ordering behavior - interruption handling - “pay now” behavior - upsell timing logic - manager escalation behavior - fallback/recovery logic - bilingual conversation flow (English/Spanish later) - voice AI optimization in ElevenLabs Current flow: Call → AI order → Stripe payment link → payment confirmation → receip...
I’m launching a Shopify-based T-shirt line and need the entire store built around an AI-driven buying experience. Shoppers must be able to: • Pick their nationality so the on-screen model automatically adjusts skin tone, facial features, and accent. • Choose size, color, fabric, and—because we specialize in Tees—our core Casual style (I may add athletic or formal later). The same AI engine will also generate short spoken videos (15–60 sec) for TikTok, Reels and YouTube Shorts. Each clip should rotate through three themes—product descriptions, promotional hooks and authentic-sounding customer reviews—ready for me to post straight from Shopify’s dashboard. Scope of work 1. Configure and brand a new Shopify store, including payment, shipping...
I’m building a real-time speech-to-text application for Tamil and need a full mobile solution that runs smoothly on both Android and iOS. The core requirement is low-latency live transcription that recognises the major dialects of Tamil—Madurai, Kongu, Nellai, Chennai and Sri Lankan variations—so users hear their words appear on-screen almost instantly, regardless of accent. My priority is accuracy and speed, followed by an interface that keeps the mic open, shows streaming text, and lets users copy, save or share the transcript once they stop speaking. If you can add useful extras such as offline mode, punctuation handling, or a light / dark theme switcher, feel free to mention them. When you respond, focus on your relevant experience: the speech-to-text engines you&rs...
Más detalles: ¿Qué funcionalidades específicas necesita la aplicación? Deslizar para emparejar, Mensajería automatizada, Cargar fotos y videos ¿Cómo deberían subir contenido las creadoras? Manual ¿Qué tipo de integración de IA prefieres para la mensajería? Chatbot con capacidad de autoaprendizaje El objetivo es q las creadoras no tengan q escribir todo el dia con 10 o 20 personas para tener ingresos q la ia tiene la conversación genere atracción y envíe fotos y videos de acuerdo a la conversación y pagos
Possiedo già un canale YouTube dedicato a video ai attualmente tratto argomenti di spiritualità, ma vorrei pian piano spostarmi su video avatar ai che trattano argomenti di salute e benessere, con un focus specifico su Alimentazione e dieta. Cerco una sola persona che segua l’intero flusso creativo fin dall’inizio: • ricerca e stesura degli script • v.o. • montaggio completo con grafiche, musica royalty-free e sottotitoli • ottimizzazione SEO (titoli, descrizioni, tag, miniature) • pubblicazione programmata e analisi delle performance Mi aspetto puntualità nelle consegne, capacità di lavorare in autonomia e voglia di crescere. Quando il canale inizierà a generare entrate rilevanti, il tuo ruolo evolver&agrav...
I am looking for a freelance developer or team to create a local AI avatar system with real-time voice interaction and facial/lip synchronization for Latin American Spanish. Currently, we already have a basic avatar that can display responses, but it does not speak or animate facial movements naturally. The goal is to build an avatar that can: Speak directly using AI-generated voice (TTS) Synchronize mouth/facial movements with speech Simulate realistic modulation using at least the 5 main vowel mouth shapes (visemes/phonemes) Run locally (offline or local server environment) Allow flexible integration with different AI providers Work primarily in Latin American Spanish Main requirements: • Local execution The system must run locally using CPU/GPU resources. Cloud dependence shou...
I want to bring everyday Hindi-English-Marathi conversations into one streamlined Android app. The idea is simple: I point the camera at a street sign or menu, the app grabs the text with Optical Character Recognition, instantly translates it, and then reads the result back to me through a clear Text-to-Speech engine. The same flow should work when I speak or type a phrase—I receive a fast, accurate translation plus an optional audio playback so I can mimic the correct pronunciation on the spot. Even for communication with auto rickshaw driver, sabzi mandi, hawkers etc. I want to make it as a communication tool Core flow • Capture text with OCR from photos or the live camera view • Translate bi-directionally between Hindi, English and Marathi • Convert transl...
Anbefalte artikler for deg
How user testing can make your product great
Get your product into the hands of test users and you'll walk away with valuable insights that could make the difference between success and failure.