Migrating from LumenWorks CsvReader

This guide helps you migrate from LumenWorks.Framework.IO to Dataplat.Dbatools.Csv.

Why Migrate?

Feature	LumenWorks	Dataplat.Dbatools.Csv
Performance	Baseline	20%+ faster
Buffer size	4KB	64KB (configurable)
Multi-char delimiters	No	Yes (`::`, `\|\|`, etc.)
Compression support	No	GZip, Deflate, Brotli, ZLib
Parallel processing	No	Yes (2-4x on large files)
String interning	No	Yes (reduces GC pressure)
Decompression bomb protection	No	Yes (10GB default limit)
Active maintenance	Limited	Active
.NET 8 optimizations	No	Yes (Span, ArrayPool)

Quick Start

Before (LumenWorks):

using LumenWorks.Framework.IO.Csv;

using (var reader = new CsvReader(new StreamReader("data.csv"), true))
{
    while (reader.ReadNextRecord())
    {
        string name = reader["Name"];
        string value = reader[1];
    }
}

After (Dataplat.Dbatools.Csv):

using Dataplat.Dbatools.Csv.Reader;

using (var reader = new CsvDataReader("data.csv"))
{
    while (reader.Read())
    {
        string name = reader.GetString("Name");
        string value = reader.GetString(1);
    }
}

Namespace Changes

LumenWorks	Dataplat.Dbatools.Csv
`LumenWorks.Framework.IO.Csv`	`Dataplat.Dbatools.Csv.Reader`
`LumenWorks.Framework.IO.Csv.CsvReader`	`Dataplat.Dbatools.Csv.Reader.CsvDataReader`

Constructor Migration

Basic Usage

LumenWorks:

// TextReader with headers
var reader = new CsvReader(textReader, hasHeaders: true);

// With delimiter
var reader = new CsvReader(textReader, true, delimiter: ';');

// Full constructor
var reader = new CsvReader(textReader, true, ';', '"', '"', '#',
    ValueTrimmingOptions.UnquotedOnly, 4096, null);

Dataplat.Dbatools.Csv:

// File path (simplest - handles streams automatically)
var reader = new CsvDataReader("data.csv");

// TextReader
var reader = new CsvDataReader(textReader);

// With options
var options = new CsvReaderOptions
{
    HasHeaderRow = true,
    Delimiter = ";",
    Quote = '"',
    Escape = '"',
    Comment = '#',
    TrimmingOptions = ValueTrimmingOptions.UnquotedOnly,
    BufferSize = 65536
};
var reader = new CsvDataReader("data.csv", options);

Property Mapping

LumenWorks Property	Dataplat.Dbatools.Csv Equivalent
`Delimiter` (char)	`CsvReaderOptions.Delimiter` (string - supports multi-char)
`Quote`	`CsvReaderOptions.Quote`
`Escape`	`CsvReaderOptions.Escape`
`Comment`	`CsvReaderOptions.Comment`
`HasHeaders`	`CsvReaderOptions.HasHeaderRow`
`FieldCount`	`CsvDataReader.FieldCount`
`CurrentRecordIndex`	`CsvDataReader.CurrentRecordIndex`
`TrimmingOption`	`CsvReaderOptions.TrimmingOptions`
`SupportsMultiline`	`CsvReaderOptions.AllowMultilineFields`
`SkipEmptyLines`	`CsvReaderOptions.SkipEmptyLines`
`NullValue`	`CsvReaderOptions.NullValue`
`BufferSize`	`CsvReaderOptions.BufferSize`
`MaxQuotedFieldLength`	`CsvReaderOptions.MaxQuotedFieldLength`
`UseColumnDefaults`	`CsvReaderOptions.UseColumnDefaults`
`DefaultHeaderName`	Auto-generated as `Column0`, `Column1`, etc.

Method Mapping

Reading Records

LumenWorks	Dataplat.Dbatools.Csv
`ReadNextRecord()`	`Read()`
`reader[0]` (indexer)	`GetString(0)` or `GetValue(0)`
`reader["Name"]` (indexer)	`GetString("Name")` or `reader["Name"]`
`GetFieldHeaders()`	`GetName(i)` for each field, or loop `FieldCount`
`GetFieldIndex("Name")`	`GetOrdinal("Name")`
`HasHeader("Name")`	`HasColumn("Name")`
`CopyCurrentRecordTo(array)`	`GetValues(array)`
`MoveTo(recordIndex)`	Not supported (forward-only)

Type Conversion

LumenWorks returns strings; you must convert manually. Dataplat.Dbatools.Csv has built-in type conversion:

LumenWorks:

int value = int.Parse(reader["Count"]);
DateTime date = DateTime.Parse(reader["Date"]);

Dataplat.Dbatools.Csv:

// Set column types upfront
reader.SetColumnType("Count", typeof(int));
reader.SetColumnType("Date", typeof(DateTime));

// Or via options
var options = new CsvReaderOptions
{
    ColumnTypes = new Dictionary<string, Type>
    {
        ["Count"] = typeof(int),
        ["Date"] = typeof(DateTime)
    }
};

// Then use typed getters
int count = reader.GetInt32("Count");
DateTime date = reader.GetDateTime("Date");

Error Handling Migration

ParseErrorAction

LumenWorks	Dataplat.Dbatools.Csv
`ParseErrorAction.RaiseEvent`	`CsvParseErrorAction.RaiseEvent` + `CollectParseErrors = true`
`ParseErrorAction.AdvanceToNextLine`	`CsvParseErrorAction.AdvanceToNextLine`
`ParseErrorAction.ThrowException`	`CsvParseErrorAction.ThrowException` (default)

LumenWorks:

reader.DefaultParseErrorAction = ParseErrorAction.RaiseEvent;
reader.ParseError += (sender, e) => {
    Console.WriteLine($"Error: {e.Error.Message}");
    e.Action = ParseErrorAction.AdvanceToNextLine;
};

Dataplat.Dbatools.Csv:

var options = new CsvReaderOptions
{
    CollectParseErrors = true,
    MaxParseErrors = 100,
    ParseErrorAction = CsvParseErrorAction.AdvanceToNextLine
};

using var reader = new CsvDataReader("data.csv", options);
while (reader.Read()) { /* process */ }

// After reading, check collected errors
foreach (var error in reader.ParseErrors)
{
    Console.WriteLine($"Row {error.RowIndex}: {error.Message}");
}

MissingFieldAction

LumenWorks	Dataplat.Dbatools.Csv
`MissingFieldAction.ParseError`	`MismatchedFieldAction.ThrowException`
`MissingFieldAction.ReplaceByEmpty`	`MismatchedFieldAction.PadWithNulls` + empty default
`MissingFieldAction.ReplaceByNull`	`MismatchedFieldAction.PadWithNulls`

Note: Dataplat.Dbatools.Csv's MismatchedFieldAction handles both missing and extra fields:

ThrowException - Fail on any mismatch (default)
PadWithNulls - Add nulls for missing fields
TruncateExtra - Ignore extra fields
PadOrTruncate - Handle both cases

Duplicate Header Handling

LumenWorks:

reader.DuplicateHeaderEncountered += (sender, e) => {
    e.HeaderName = $"{e.HeaderName}_{e.Index}";
};

Dataplat.Dbatools.Csv:

var options = new CsvReaderOptions
{
    DuplicateHeaderBehavior = DuplicateHeaderBehavior.Rename // Name, Name_2, Name_3
    // Other options: ThrowException, UseFirstOccurrence, UseLastOccurrence, Ignore
};

New Features (No LumenWorks Equivalent)

Multi-Character Delimiters

var options = new CsvReaderOptions { Delimiter = "::" };
var options = new CsvReaderOptions { Delimiter = "||" };
var options = new CsvReaderOptions { Delimiter = "\t\t" };

Compression Support

// Automatic detection from file extension
using var reader = new CsvDataReader("data.csv.gz");

// Explicit configuration
var options = new CsvReaderOptions
{
    CompressionType = CompressionType.GZip,
    MaxDecompressedSize = 100 * 1024 * 1024 // 100MB limit
};

Parallel Processing

var options = new CsvReaderOptions
{
    EnableParallelProcessing = true,
    MaxDegreeOfParallelism = 4,
    ParallelBatchSize = 100
};

String Interning

var options = new CsvReaderOptions
{
    InternStrings = true,
    CustomInternStrings = new HashSet<string> { "Active", "Inactive", "Pending" }
};

Static Columns

Inject computed columns into every record:

var options = new CsvReaderOptions
{
    StaticColumns = new List<StaticColumn>
    {
        new StaticColumn("ImportDate", DateTime.Now),
        new StaticColumn("SourceFile", "data.csv")
    }
};

Column Filtering

var options = new CsvReaderOptions
{
    IncludeColumns = new HashSet<string> { "Id", "Name", "Email" },
    // Or exclude specific columns:
    ExcludeColumns = new HashSet<string> { "InternalId", "TempField" }
};

Skip Rows (Preamble)

var options = new CsvReaderOptions
{
    SkipRows = 3 // Skip first 3 rows before reading headers
};

Lenient Quote Handling

For malformed CSVs with unmatched quotes or backslash escapes:

var options = new CsvReaderOptions
{
    QuoteMode = QuoteMode.Lenient
};

Null vs Empty String Distinction

// Distinguish between ,, (null) and ,"", (empty string)
var options = new CsvReaderOptions
{
    DistinguishEmptyFromNull = true
};

SqlBulkCopy Integration

Both libraries implement IDataReader, so SqlBulkCopy usage is identical:

using var reader = new CsvDataReader("data.csv");
using var connection = new SqlConnection(connectionString);
connection.Open();

using var bulkCopy = new SqlBulkCopy(connection);
bulkCopy.DestinationTableName = "MyTable";
bulkCopy.WriteToServer(reader);

Complete Migration Example

LumenWorks:

using LumenWorks.Framework.IO.Csv;

using (var textReader = new StreamReader("data.csv"))
using (var reader = new CsvReader(textReader, true, ';', '"', '"', '#',
    ValueTrimmingOptions.UnquotedOnly, 4096, "NULL"))
{
    reader.MissingFieldAction = MissingFieldAction.ReplaceByNull;
    reader.DefaultParseErrorAction = ParseErrorAction.AdvanceToNextLine;

    while (reader.ReadNextRecord())
    {
        string id = reader["Id"];
        string name = reader["Name"];
        int count = int.Parse(reader["Count"]);
        // Process record...
    }
}

Dataplat.Dbatools.Csv:

using Dataplat.Dbatools.Csv.Reader;

var options = new CsvReaderOptions
{
    Delimiter = ";",
    Quote = '"',
    Escape = '"',
    Comment = '#',
    TrimmingOptions = ValueTrimmingOptions.UnquotedOnly,
    BufferSize = 65536, // Larger default for better performance
    NullValue = "NULL",
    MismatchedFieldAction = MismatchedFieldAction.PadWithNulls,
    ParseErrorAction = CsvParseErrorAction.AdvanceToNextLine,
    CollectParseErrors = true,
    ColumnTypes = new Dictionary<string, Type>
    {
        ["Count"] = typeof(int)
    }
};

using (var reader = new CsvDataReader("data.csv", options))
{
    while (reader.Read())
    {
        string id = reader.GetString("Id");
        string name = reader.GetString("Name");
        int count = reader.GetInt32("Count"); // Direct typed access
        // Process record...
    }

    // Check for any errors that occurred
    if (reader.ParseErrors.Any())
    {
        foreach (var error in reader.ParseErrors)
        {
            Console.WriteLine($"Row {error.RowIndex}: {error.Message}");
        }
    }
}

Troubleshooting

"Column not found" errors

LumenWorks is case-sensitive by default. Dataplat.Dbatools.Csv uses case-insensitive column lookup by default. If you have columns with names differing only by case, use GetOrdinal() to get the exact index.

Performance regression

If you see slower performance after migration:

Increase BufferSize (default is already 64KB vs LumenWorks' 4KB)
Enable InternStrings for files with many repeated values
Enable EnableParallelProcessing for files over 100K rows

Different null handling

LumenWorks treats empty fields as empty strings by default. If you need null semantics:

var options = new CsvReaderOptions
{
    DistinguishEmptyFromNull = true
};

Missing MoveTo() functionality

Dataplat.Dbatools.Csv is forward-only for performance. If you need random access:

Read all records into a collection first
Use the collection for random access

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrating from LumenWorks CsvReader

Why Migrate?

Quick Start

Namespace Changes

Constructor Migration

Basic Usage

Property Mapping

Method Mapping

Reading Records

Type Conversion

Error Handling Migration

ParseErrorAction

MissingFieldAction

Duplicate Header Handling

New Features (No LumenWorks Equivalent)

Multi-Character Delimiters

Compression Support

Parallel Processing

String Interning

Static Columns

Column Filtering

Skip Rows (Preamble)

Lenient Quote Handling

Null vs Empty String Distinction

SqlBulkCopy Integration

Complete Migration Example

Troubleshooting

"Column not found" errors

Performance regression

Different null handling

Missing MoveTo() functionality

Uh oh!

FilesExpand file tree

MIGRATING-FROM-LUMENWORKS.md

Latest commit

History

MIGRATING-FROM-LUMENWORKS.md

File metadata and controls

Migrating from LumenWorks CsvReader

Why Migrate?

Quick Start

Namespace Changes

Constructor Migration

Basic Usage

Property Mapping

Method Mapping

Reading Records

Type Conversion

Error Handling Migration

ParseErrorAction

MissingFieldAction

Duplicate Header Handling

New Features (No LumenWorks Equivalent)

Multi-Character Delimiters

Compression Support

Parallel Processing

String Interning

Static Columns

Column Filtering

Skip Rows (Preamble)

Lenient Quote Handling

Null vs Empty String Distinction

SqlBulkCopy Integration

Complete Migration Example

Troubleshooting

"Column not found" errors

Performance regression

Different null handling

Missing MoveTo() functionality