C#でPDFファイルを読む

VB C#

using IronPdf;
using IronSoftware.Drawing;
using System.Collections.Generic;

// Extracting Image and Text content from Pdf Documents

// open a 128 bit encrypted PDF
var pdf = PdfDocument.FromFile("encrypted.pdf", "password");

// Get all text to put in a search index
string text = pdf.ExtractAllText();

// Get all Images
var allImages = pdf.ExtractAllImages();

// Or even find the precise text and images for each page in the document
for (var index = 0 ; index < pdf.PageCount ; index++)
{
    int pageNumber = index + 1;
    text = pdf.ExtractTextFromPage(index);
    List<AnyBitmap> images = pdf.ExtractBitmapsFromPage(index);
    //...
}

Imports IronPdf
Imports IronSoftware.Drawing
Imports System.Collections.Generic

' Extracting Image and Text content from Pdf Documents

' open a 128 bit encrypted PDF
Private pdf = PdfDocument.FromFile("encrypted.pdf", "password")

' Get all text to put in a search index
Private text As String = pdf.ExtractAllText()

' Get all Images
Private allImages = pdf.ExtractAllImages()

' Or even find the precise text and images for each page in the document
For index = 0 To pdf.PageCount - 1
	Dim pageNumber As Integer = index + 1
	text = pdf.ExtractTextFromPage(index)
	Dim images As List(Of AnyBitmap) = pdf.ExtractBitmapsFromPage(index)
	'...
Next index

Install-Package IronPdf

C#でPDFファイルを読む

IronPDF C# PDF ユーティリティの PdfDocument.ExtractAllText メソッドは、標準的なPDFテキスト読み取りタスクに最適です。このメソッドは、ソース PDF ドキュメント内の空白やエンコーディングの不一致を問題なく処理します。

PdfDocument.ExtractTextFromPageは、PDFの特定のページからテキストを読み取ります。上記の例では、特定のページ範囲からテキストコンテンツを繰り返し取得するために使用されている様子がわかります。

IronPDFはPDFから生の画像を抽出することもできます。以下の PdfDocument クラスのいずれかのメソッドを使用してください:

ExtractAllImages：PDFに埋め込まれたすべての画像を IronSoftware.Drawing.AnyBitmap オブジェクトとして返します。
ExtractAllRawImages: 埋め込まれたすべての画像を生のバイトのリストとして取得します(バイト[]もちろん、英語のテキストを教えていただけますでしょうか？).
ExtractImagesFromPage: インデックス化されたページに含まれる画像を抽出します。
ExtractImagesFromPages: ExtractImagesFromPage と同様ですが、特定のページ範囲や個々のページのリストから抽出します。
ExtractRawImagesFromPage と ExtractRawImagesFromPages：以前の2つのメソッドと同様に動作しますが、抽出された画像を IronSoftware.Drawing.AnyBitmap オブジェクトとしてではなく、バイト配列として返します。

もちろんです。翻訳先の内容を入力してください。
C#でPDFファイルを読み取る方法
1. C#用IronPDFライブラリのダウンロード
2. PDFから画像またはテキストを抽出する
3. 特定のドキュメントにおける単語の読み取りおよび検索
4. 元のドキュメントからPDF出力を表示