How This Free AI Agent With N8N Workflow Converts YouTube Content Into Shareable Infographics Instantly
The Skill That Is Quietly Making People a Lot of Money
Building and selling an ai agent with n8n is one of the highest paying skills a person can develop right now, and the gap between those who understand automation and those who do not is growing wider every single month that passes.
Most people sit through hours of content every week, trying to absorb information, take notes, and retain key ideas, only to forget the bulk of it within days because there is simply no structured system in place to process what they have consumed.
What if there was a fully automated system that could take any content link, extract every spoken word from it, and then transform that raw transcript into a clean, visually rich summary infographic — all without a single line of manual code?
That is exactly what this guide walks through: a complete, step-by-step n8n automation workflow that accepts a content link as input and delivers a professionally styled infographic as its output, hosted on the cloud and ready to view instantly.
ProfitAgent is already helping creators and marketers tap into this exact category of AI-powered automation tools, and if you are serious about monetizing your skills in this space, getting familiar with platforms like this gives you a serious competitive edge before the mainstream catches on.
We strongly recommend that you check out our guide on how to take advantage of AI in today’s passive income economy.
Table of Contents
What This N8N AI Agent Workflow Actually Does
The entire workflow rests on five interconnected nodes, each one performing a specific job in the automation chain, and together they create a seamless pipeline that requires zero manual intervention once it is live and deployed.
The first node captures a content link through a simple web form that any user can fill out, the second node sends that link to a transcript extraction API and receives a full text version of everything spoken in that content, and the third node feeds that transcript into Google’s Gemini image generation API with a detailed prompt that instructs it to produce a high-quality whiteboard-style infographic summarizing all the core ideas.
The fourth node takes that generated image and uploads it to Cloudinary, a reliable cloud-based image hosting platform, which returns a secure public URL where the image is now permanently accessible, and the fifth node closes the loop by redirecting the user’s form session directly to that URL so the infographic appears right in front of them without any extra steps required.
AutoClaw is built around this exact philosophy of automation that removes human bottlenecks from content workflows, and the kind of node-based logic you see in this n8n build is very much aligned with how serious automation systems operate at scale.
Setting Up Your N8N Environment on Hostinger VPS
Before a single node is placed in the workflow canvas, the environment needs to be properly configured, and the right infrastructure decision here makes a significant difference in how reliably the ai agent with n8n performs under real usage conditions.
Self-hosting n8n on a Virtual Private Server gives full control over processing power, memory allocation, and uptime reliability, none of which can be guaranteed when relying on a shared cloud platform with usage caps and execution limits that cut workflows short at critical moments.
Hostinger’s VPS offering, specifically the KVM2 plan, provides the kind of baseline computing power that keeps n8n workflows running smoothly even when processing heavier tasks like AI image generation that can take several seconds to complete per execution.
To get started, navigate to the Hostinger website, move through the services menu to the VPS section, and select the self-hosted n8n option, which pre-configures the server environment so that n8n is installed and ready to use the moment the server is activated.
During the setup process, make sure the server location is set correctly and that n8n is selected as the application in the operating system settings, because getting this right from the beginning saves significant troubleshooting time later when the workflow is already live.
Once payment is confirmed and a root password is set, the VPS dashboard becomes accessible within approximately five minutes, and from there the manage app button leads directly to the n8n login screen where an account is created and tied to the Hostinger environment.
AISystem is a powerful resource for anyone who wants a bundled AI toolkit that complements the kind of infrastructure setup being described here, because having the right combination of tools running in the right environment is what separates functional prototypes from genuine production-ready systems.
Building the First Node — The Input Form
With the n8n dashboard open and a new workflow canvas on screen, the first node to place is the trigger that starts the entire automation, and for this particular build the trigger is a form submission rather than a manual click or an API webhook.
Form submission is selected as the trigger type, and within the form node settings the title is set to something descriptive like Infographic Creator, authentication is turned off to keep the form publicly accessible, and the response behavior is configured to wait until the full workflow has finished executing before returning any output to the user.
This last setting is critically important because without it the form would return a blank response immediately after submission, before the transcript has been fetched, before the image has been generated, and before the hosting URL has been created, leaving the user with nothing to see.
A single form field is added with the label asking for the content link, the element type is set to text since a URL is just a string of characters, and required is toggled on so the form cannot be submitted empty, which would cause downstream nodes to fail without a meaningful URL to process.
Running a test execution at this stage reveals the form in its basic state: a clean, minimal input field sitting in the browser waiting for a link to be entered, and submitting any valid content URL at this point confirms that the node captures and passes it forward correctly.
ProfitAgent operates with a similar input-to-output philosophy where simple user actions trigger complex automated sequences in the background, which is a design pattern that scales extremely well across different use cases and audience types.
Extracting the Transcript With a Free API
The second node handles transcript extraction, and this is where many people building similar workflows hit a wall because pulling the spoken content out of any media URL requires either a paid service or knowing where to find a well-rated free alternative.
RapidAPI is a marketplace that aggregates hundreds of different APIs under a single account dashboard, and it provides five dollars worth of free credits upon signup that are more than sufficient for testing this entire workflow multiple times across different content links.
After creating a free account on RapidAPI and navigating to the console, the API key is found under settings in the API and integration section, and this key is what authenticates every request made from the n8n workflow to any API on the marketplace.
Back in n8n, an HTTP Request node is added after the form trigger, the method is set to POST because the request is sending data to the API rather than simply retrieving a static resource, and the endpoint URL is taken directly from a YouTube transcript scraper API on RapidAPI that carries a strong user rating above four stars.
Query parameters are added with the token field set to the RapidAPI key, and the request body is configured as JSON with a field named for the content URL whose value is dynamically pulled from the form node’s output by dragging and dropping the field reference directly into the value input.
This drag-and-drop wiring is one of n8n’s most user-friendly features and is a core reason why building an ai agent with n8n feels so much more accessible than traditional development workflows that require writing connection logic manually in code.
AutoClaw is particularly well-suited for automators who want pre-built intelligence layered on top of custom workflows like this one, because it fills in the analytical gaps that raw API data alone cannot address without additional processing.
Generating the Infographic Using Gemini Imagen 3
The third node is where the most visually dramatic transformation happens, and it is also the step that makes this ai agent with n8n workflow genuinely impressive to anyone who sees its output for the first time.
A Gemini node is added with the operation set to generate an image, and new credentials are created by pasting in a Google AI Studio API key, which is obtained for free by visiting AI Studio, navigating to the API keys section, creating a new project, and generating a key that is tied to that project.
The model selected is Gemini Imagen 3 Pro, which currently sits at the frontier of AI image generation for structured visual content like infographics, diagrams, and educational sketches, and it is accessible entirely through the free tier of the Google AI API with reasonable daily generation limits.
The prompt field is where the quality of the output is truly determined, and a well-structured prompt instructs the model to analyze the transcript content, extract the core thesis, identify four to six distinct mental models or arguments, identify the concluding insight, and then render all of that into a high-fidelity sketch-note style infographic with a defined visual language including arrows, connectors, simple icons, a specific typography style, and a particular color palette.
The transcript data from the previous node is passed into the prompt using a small script that strips all timestamps from the API response and joins every caption segment into a single continuous paragraph of clean text, making it easier for the model to read and interpret without getting distracted by time markers embedded in the raw transcript format.
Executing this node with a real transcript takes approximately sixty seconds because the model is generating a full high-resolution image through a live API call, but the result that comes back is a polished, professionally styled infographic that captures the essential structure of the original content in a format that is immediately shareable and visually clear.
AISystem is a strong companion resource for anyone looking to go beyond single-model workflows and start combining multiple AI systems in coordinated pipelines, which is the natural next step once a workflow like this one is running reliably in production.
Hosting the Infographic on Cloudinary
Once the image has been generated, it exists only as binary data inside the n8n workflow, which means it needs to be uploaded to a permanent hosting location before it can be shared or displayed to the end user through a redirect URL.
Cloudinary is the hosting platform used for this purpose, and after creating a free account and navigating to the upload settings, an upload preset is created with unsigned access permissions and given a descriptive name that will serve as the organizational folder for all images generated through this workflow.
Back in n8n, another HTTP Request node is added with the POST method, the URL is constructed using the Cloudinary cloud name found in the account dashboard, and the body is configured as form data with two fields: one for the upload preset name and one for the binary image file passed in from the Gemini node.
Executing this node returns a JSON response that includes a secure URL field containing the fully hosted public address of the uploaded infographic, and this URL is what gets passed into the final node of the workflow to display the result to the user.
ProfitAgent gives marketers and creators who build tools like this a smarter way to monetize their automations by connecting them to AI income systems that are already converting, which is how you turn a working workflow into an actual revenue-generating asset.
Closing the Loop — Displaying the Output
The final node is an n8n form ending node, and it is configured to redirect the user’s browser session to the Cloudinary URL rather than displaying a static confirmation message, which means the infographic appears directly in the same browser window where the form was originally submitted.
A timeout is set within this node to allow up to three minutes for the preceding nodes to complete, which covers the time needed for transcript extraction, image generation, and cloud upload to all finish before the redirect fires, ensuring the user is never sent to a broken or empty URL.
With all five nodes connected and tested, the workflow is saved, given a name, and published, which switches it from test mode to production mode and activates the live form URL that can be shared with anyone who wants to use the tool.
Entering any valid content link into the production form and clicking submit initiates the full automation pipeline in the background, and within a few minutes the browser redirects to a richly detailed infographic that visually maps out everything discussed in the original content — completely automated, completely free of manual effort.
AutoClaw is the kind of platform that takes what you build here and helps you push it further, giving you the automation intelligence to scale this type of workflow across multiple content sources, multiple users, and multiple output formats simultaneously.
Extending the Workflow — What You Can Build Next
This five-node build is a solid foundation, but n8n’s node library is deep enough to extend this workflow in dozens of directions without rebuilding anything from scratch.
A WhatsApp node can be added after the Cloudinary upload step to automatically send the infographic directly to a phone number provided on the original form, turning the tool into a mobile-friendly content summarizer that delivers results right to the user’s messaging app.
A Gmail node can be wired in to attach the infographic to an outgoing email addressed to whatever contact was entered on the form, making it possible to build a simple email-based infographic delivery service with zero additional infrastructure.
The Gemini prompt itself can be modified to generate different output formats depending on what the user selects on the form: a mind map style for brainstorming content, a timeline style for historical or sequential content, or a comparison chart style for content that evaluates multiple options against each other.
AISystem brings together a complete suite of AI-powered tools that pair naturally with custom n8n builds like this one, and for anyone serious about building a scalable content automation business in 2026, having access to a full AI system rather than individual tools makes the entire operation significantly more efficient.
Conclusion: This Is Where Automation Skill Meets Real Income Potential
Building an ai agent with n8n at the level covered in this guide is not a small technical achievement — it is a demonstration of exactly the kind of automation thinking that the market is actively paying for, and the workflow built here is a real, deployable product that provides genuine value to anyone who uses it.
The tools used across this entire build — n8n for workflow orchestration, Hostinger VPS for reliable self-hosted infrastructure, RapidAPI for free transcript extraction, Gemini Imagen 3 for AI-powered image generation, and Cloudinary for permanent cloud hosting — are all accessible at low or zero cost, which means the barrier to entry is entirely a matter of understanding how to connect them correctly.
ProfitAgent remains one of the clearest entry points for anyone who wants to monetize AI automation skills without starting from scratch, because it combines proven systems with a beginner-accessible income model that works in parallel with the kind of custom building taught in this guide.
AutoClaw continues to be a top recommendation for automators who want their systems to work harder, reach further, and convert better without requiring a proportional increase in the manual effort needed to manage them.
And AISystem rounds out the full picture by giving serious builders access to the kind of comprehensive AI toolkit that makes scaling from one working workflow to an entire portfolio of income-generating automations a realistic and achievable outcome in 2026.

We strongly recommend that you check out our guide on how to take advantage of AI in today’s passive income economy.
