syncfusion-dotnet-smart-data-extraction
Smart Data Extractor — Syncfusion
Overview
Extracts complete document structures from PDFs and images files using the Syncfusion SmartDataExtractor Library. This skill supports one operational mode — generating C# code for the user's project.
Key Capabilities
- Document structure extraction: Identify text elements, images, headers, footers, and tables (including regions, header rows, columns, cell boundaries, and merged cells).
- File format support: Works with PDF documents and common image formats such as JPEG and PNG.
- Table extraction: Specialized capability to extract tabular data.
- Form recognition: Detects and processes structured form data.
- Page-level control: Extract data from specific pages or defined page ranges.
- Confidence threshold: Results are filtered based on a configurable confidence score (0.0–1.0).
Prerequisites
- Install required runtime and library packages from NuGet before running extraction.
More from syncfusion/document-sdk-skills
syncfusion-dotnet-word
Create, edit, format, and convert Word (.docx) documents using Syncfusion DocIO for .NET. Use this skill for Word processing and DOCX automation when the user asks to generate Word files, modify document content, insert tables or images, apply formatting, automate document workflows, or convert Word to PDF using C# code or CSX execution.
66syncfusion-dotnet-pdf
Create, read, edit, secure, sign, and convert PDF documents (.pdf) using Syncfusion PDF Library for .NET. Use this skill for PDF processing and document automation when the user asks to generate PDF files, modify PDF content, add security or signatures, extract text or images, merge or split PDFs, or perform PDF/A conversion using C# code or CSX execution.
52syncfusion-dotnet-powerpoint
Create, edit, format, and convert PowerPoint (.pptx) presentations using Syncfusion Presentation for .NET. Use this skill for PowerPoint processing and presentation automation when the user asks to generate slides, modify PPTX content, insert text or images, build presentations programmatically, or convert PowerPoint to PDF using C# code or CSX execution.
49syncfusion-java-word
Create, edit, and convert Word documents (.docx) using Syncfusion DocIO. Supports to generate java code for the user's project. Use when the user mentions docx, Word processing, document generation, Syncfusion DocIO, or syncfusion java word.
46syncfusion-dotnet-excel
Create, edit, and convert Excel workbooks (.xlsx/.xls) using Syncfusion XlsIO. Supports two modes — generate C# code for the user's project, or execute a temporary CSX script. Use when the user mentions Excel, xlsx, workbook, template markers, Syncfusion XlsIO, or PDF conversion.
45syncfusion-javascript-pdf
Provides comprehensive guidance for implementing the Syncfusion JavaScript PDF library (@syncfusion/ej2-pdf) to create and manipulate PDF documents programmatically across TypeScript, JavaScript, Angular, React, Vue, and ASP.NET platforms. Use this when working with PDF creation, form filling, annotations, document merging/splitting, text or image extraction, or digital signatures.
43