Skip to content

Commit 5f3bcfa

Browse files
committed
address Copilot feedback
1 parent ef70e3d commit 5f3bcfa

File tree

1 file changed

+6
-5
lines changed

1 file changed

+6
-5
lines changed

docs/ai/conceptual/data-ingestion.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@ author: luisquintanilla
55
ms.author: luquinta
66
ms.date: 11/11/2025
77
ms.topic: concept-article
8+
ai-usage: ai-assisted
89
---
910

1011
# Data Ingestion
@@ -63,7 +64,7 @@ The [Microsoft.Extensions.DataIngestion](https://www.nuget.org/packages/Microsof
6364

6465
At the foundation of the library is the `IngestionDocument` type, which provides a unified way to represent any file format without losing important information. The `IngestionDocument` is Markdown-centric because large language models work best with Markdown formatting.
6566

66-
The `IngestionDocumentReader` abstraction handles loading documents from various sources, whether local files or streams. There are few readers available:
67+
The `IngestionDocumentReader` abstraction handles loading documents from various sources, whether local files or streams. A few readers are available:
6768

6869
- **[MarkItDown](https://www.nuget.org/packages/Microsoft.Extensions.DataIngestion.MarkItDown)**
6970
- **[Markdown](https://www.nuget.org/packages/Microsoft.Extensions.DataIngestion.Markdig/)**
@@ -137,10 +138,10 @@ using VectorStoreWriter<string> writer = new(vectorStore, dimensionCount: 1536);
137138

138139
The `IngestionPipeline<T>` API allows you to chain together the various data ingestion components into a complete workflow. You can combine:
139140

140-
- **Readers** to load documents from various sources
141-
- **Processors** to transform and enrich document content
142-
- **Chunkers** to break documents into manageable pieces
143-
- **Writers** to store the final results in your chosen data store
141+
- **Readers** to load documents from various sources.
142+
- **Processors** to transform and enrich document content.
143+
- **Chunkers** to break documents into manageable pieces.
144+
- **Writers** to store the final results in your chosen data store.
144145

145146
This pipeline approach reduces boilerplate code and makes it easy to build, test, and maintain complex data ingestion workflows.
146147

0 commit comments

Comments
 (0)