markdown-converter
Install this skill
npx skills add intellectronica/agent-skillsWorks across Claude Code, Cursor, Codex, Copilot & Antigravity
The markdown-converter skill transforms a wide array of file formats into structured Markdown syntax using the markitdown utility. It acts as a bridge between proprietary document formats and text-based formats suitable for large language model context windows or static site generation. By executing directly via uvx, it avoids local dependency management issues while maintaining support for complex objects like tables, lists, and headings. This tool extracts content from office suites, scanned imagery via OCR, and streaming media like YouTube. It also integrates with Azure Document Intelligence for high-fidelity extraction of complex PDF layouts. Whether processing local archives or piping data from command-line streams, the tool standardizes disparate data sources into a uniform, readable format, simplifying downstream data processing and documentation tasks without manual copying or reformatting.
When to Use This Skill
- •Ingesting legacy Excel or Word documents into a documentation repository
- •Converting scanned research papers into searchable Markdown text files
- •Extracting metadata and transcriptions from video or audio assets for indexing
- •Normalizing various data formats like JSON or XML for easier textual comparison
- •Preparing large sets of heterogeneous files for RAG system ingestion
How to Invoke This Skill
Example prompts that trigger this skill in Claude Code, Cursor, or Antigravity:
- “Convert this PDF into markdown for me
- “Transform the report.docx file to a text-based format
- “Extract the data from this spreadsheet into a markdown table
- “Create a markdown summary of this document file
- “Run a conversion on this slide deck to get the text out
Pro Tips
- 💡For conversions from `stdin` where the file type isn't obvious (e.g., piped content), always use `-x` or `-m` options to hint the extension or MIME type for more accurate parsing.
- 💡Combine this skill with a text summarization agent to automatically distill converted documents, providing concise outputs from complex source material.
- 💡When dealing with scanned documents or challenging PDFs, leverage the `-d` flag to utilize Azure Document Intelligence for superior OCR and layout understanding, significantly improving Markdown quality.
What this skill does
- •Transforms office suites including Word, Excel, and PowerPoint into Markdown
- •Performs OCR on image files to extract embedded text
- •Extracts textual content from YouTube URLs and audio files
- •Supports piping inputs from stdin for automated processing pipelines
- •Integrates with Azure Document Intelligence for sophisticated PDF table and structure recovery
When not to use it
- ✕When you need to preserve original visual styling, fonts, or exact layout positioning
- ✕When dealing with encrypted or password-protected document files
Example workflow
- Locate the source document file in your working directory
- Execute the conversion command pointing to the file path
- Specify an output destination to save the generated markdown file
- Review the resulting file to ensure table and list formatting is accurate
- Refine the output by adding manual corrections if complex visuals were present
Prerequisites
- –uv installed on the system
- –Azure Document Intelligence endpoint (optional for advanced PDFs)
Pitfalls & limitations
- !PDFs with complex multi-column layouts may produce disorganized text without the Azure plugin
- !Large media files can result in long processing times during the transcription phase
- !Non-textual elements in spreadsheets may be lost during the conversion process
FAQ
How it compares
Unlike manual copy-pasting which often breaks table alignment and list structures, this utility enforces programmatic parsing that maintains the logical hierarchy and semantic integrity of the source document.
📄 Full skill instructions — original source: intellectronica/agent-skills
Convert files to Markdown using
uvx markitdown — no installation required.## Basic Usage
# Convert to stdout
uvx markitdown input.pdf
# Save to file
uvx markitdown input.pdf -o output.md
uvx markitdown input.docx > output.md
# From stdin
cat input.pdf | uvx markitdown## Supported Formats
- **Documents**: PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls)
- **Web/Data**: HTML, CSV, JSON, XML
- **Media**: Images (EXIF + OCR), Audio (EXIF + transcription)
- **Other**: ZIP (iterates contents), YouTube URLs, EPub
## Options
-o OUTPUT # Output file
-x EXTENSION # Hint file extension (for stdin)
-m MIME_TYPE # Hint MIME type
-c CHARSET # Hint charset (e.g., UTF-8)
-d # Use Azure Document Intelligence
-e ENDPOINT # Document Intelligence endpoint
--use-plugins # Enable 3rd-party plugins
--list-plugins # Show installed plugins## Examples
# Convert Word document
uvx markitdown report.docx -o report.md
# Convert Excel spreadsheet
uvx markitdown data.xlsx > data.md
# Convert PowerPoint presentation
uvx markitdown slides.pptx -o slides.md
# Convert with file type hint (for stdin)
cat document | uvx markitdown -x .pdf > output.md
# Use Azure Document Intelligence for better PDF extraction
uvx markitdown scan.pdf -d -e "https://your-resource.cognitiveservices.azure.com/"## Notes
- Output preserves document structure: headings, tables, lists, links
- First run caches dependencies; subsequent runs are faster
- For complex PDFs with poor extraction, use
-d with Azure Document IntelligenceHow to Use This Skill Unit
Option A: Project-Specific (Recommended)
- Click "Download" above
- In your project, create the directory:
.agent/skills/markdown-converter/ - Save the file as
SKILL.md - The agent will automatically discover the skill based on its description.
Option B: Global Installation (All Agents)
Save the file to these locations to make it available across all projects:
- Claude Code:
~/.claude/skills/intellectronica/agent-skills/markdown-converter/SKILL.md - Cursor:
~/.cursor/skills/intellectronica/agent-skills/markdown-converter/SKILL.md - Antigravity:
~/.gemini/antigravity/skills/intellectronica/agent-skills/markdown-converter/SKILL.md
🚀 Install with CLI:npx skills add intellectronica/agent-skills