FyPDF — PDF & file conversion suite · Office

Patch the line between Office and PDF.

Most online converters mis-spell their job — they take Word and turn it into a giant flat image; they take Excel and stitch the cells into one long string. The switchboard does the operator's work. Three lines patched, three handoffs respected: Word to PDF with styles intact, Excel to CSV with columns tabular, HTML to PDF with print rules honored.

  • Layout survives the patch
    Word column structure, Excel cell grids, and HTML print rules map across to PDF without flattening to images or losing the document model.
  • Tables stay tabular
    Excel → CSV emits a real grid: actual columns, header inference, encoding declared. Not stitched text glued together with commas.
  • Hyperlinks stay live
    Outbound URLs and internal cross-references rebuild on the other side — clickable in the PDF, indexable in the CSV's metadata.
  • Outputs are real exchange formats
    Real PDFs that Adobe and Preview open natively. Real CSVs that Excel, Numbers, and pandas read without an import wizard.
Operator · BridgehousePatched · 3 / 3
Word.docx
PDF.pdf
Lock
Word to PDF
Excel.xlsx
CSV.csv
Extract
Excel to CSV
HTML.html
PDF.pdf
Capture
HTML to PDF
Status03:14 · all lines clear
bridgehouse · console 03
3 routes · in service
Switchboard · 3 lines patched
BrowserCloud-assist
The Routes Ledger

Three lines, every one patched honestly.

Each row names the source format, the target format, the editorial promise, and what stays intact across the patch — written down so the handoff is contractual, not implied.

Routing Manifest · Issue 05Office Track · 3 routes patched
No.
Route
Promise
Preserves
Open
01Route
Word.docx
PDF.pdf
LockWord to PDF

Lock a Word document into its final layout — paragraph styles, headings, lists, tables, and inline images all map to a real, distribution-ready PDF.

StylesOutline treeImagesHyperlinksTables
02Route
Excel.xlsx
CSV.csv
ExtractExcel to CSV

Extract Excel sheets into clean CSV with column types inferred, header detection, and UTF-8-with-BOM encoding so any downstream system can read them.

Cell typesHeader rowsPer-sheet splitNumeric precision
03Route
HTML.html
PDF.pdf
CaptureHTML to PDF

Capture web content with print-aware fidelity — @media print rules respected, page breaks honored, web fonts embedded, and headers/footers per page.

Print CSSWeb fontsSection anchorsVector strokesImage alt text
Lock · best for

Contracts for execution · Manuscripts for distribution · Policy documents for archive · Legal filings

.docx.pdf
Extract · best for

ETL pipelines · Audit hand-offs · Open-data publishing · Cross-system reporting

.xlsx.csv
Capture · best for

Release notes · Whitepapers · Reports for archive · Print-ready landing pages

.html.pdf
The Fidelity Manifesto

What the bridgehouse never garbles in transit.

Four promises stamped onto every line as it leaves the desk. The cheap path is to flatten Word to a JPEG and stitch Excel cells with commas — these are the four guarantees you give up when you take it.

A
BureauPromise · 01

Layouts survive the patch

FyPDF's Office track maps document models, not pixels. Word's column structure, Excel's cell grids, and HTML's print rules all translate to PDF/CSV at the structure level — fonts re-embed, columns re-flow only where the source asked them to, and pagination follows the source document's intent.

SpecDocument-model translation · Embedded fonts retained · @media print honored · Paragraph styles preserved.
B
BureauPromise · 02

Tables stay tabular

Excel → CSV emits a real grid. Header rows are detected, numeric columns come back typed, merged regions are flagged before save, and the encoding is declared so Excel-on-Windows opens the result without an import wizard. Not stitched text glued together with commas.

SpecHeader inference · UTF-8 + BOM · ISO date format · Quoted strings · Per-sheet files.
C
BureauPromise · 03

Hyperlinks stay live

Outbound URLs and internal cross-references rebuild on the other side of the patch. PDFs come back with clickable links. CSVs preserve URL columns as URLs, not as escaped strings. Section anchors in HTML translate into PDF outline destinations.

SpecURL preservation · PDF outline destinations · Cross-reference re-anchoring · No href escaping.
D
BureauPromise · 04

Real exchange formats out

Outputs are first-class files in their target ecosystems. PDFs open natively in Adobe, Preview, Foxit, and browser readers. CSVs read in Excel, Numbers, pandas, and ETL ingestion without preprocessing. Nothing proprietary, nothing locked.

SpecPDF/A-compatible · CSV RFC-4180 compliant · No proprietary wrappers · Re-editable downstream.
The Tool Spreads

Read each route as its own datasheet.

What each line keeps intact, what it transforms, and where it pairs back into the rest of the suite. Three spreads, in order; alternate sides per spread to keep the spreads scannable.

01
SpreadDOCX → PDF

Word to PDF

Lock a Word document into a final-form PDF without losing the structure underneath. Paragraph styles, heading hierarchy, numbered/bulleted lists, inline images, hyperlinks, and tables all carry across. Tracked changes can either be accepted on the way out or kept as redline annotations in the resulting PDF.

Preserves
  • Heading hierarchy and paragraph styles
  • Numbered and bulleted list rendering
  • Inline images at original resolution
  • Hyperlinks and bookmarks
  • Tables and cell merges
Transforms
  • Embedded fonts subsetted to in-use glyphs
  • Tracked changes flattened or kept (your choice)
  • Section breaks → PDF page breaks
  • Word footnotes → PDF footer text with anchors
Formats.docx in.pdf outtracked-change handling
DOCX → PDF · §outline · h1 · h2 · h3
H1
H2
Redline
insdelins
redlines kept— 12 —
styles · headings · redlines
02
SpreadXLSX → CSV

Excel to CSV

Extract every sheet of an Excel workbook into clean, encoded CSV. Header rows are detected, numeric and date columns come back typed, currency symbols are stripped from values but kept in headers. Output is one CSV per sheet with the file naming pattern you choose.

Preserves
  • Header row detection on each sheet
  • Numeric and date column typing
  • Cell-formula evaluated values (not formulas)
  • Per-sheet structure across multi-sheet workbooks
Transforms
  • Output one CSV per sheet, named to match
  • UTF-8 with BOM for Excel-on-Windows compatibility
  • ISO 8601 date format · 2025-04-21 not 4/21/25
  • Currency symbols stripped from values, headers kept
Formats.xlsx in.csv × N outUTF-8 + BOM
sheet · "Q2 invoices"A1:D5
Date
Vendor
Amount
Status
2025-04-01
Acme
$1,420
Paid
2025-04-04
Globex
$ 980
Paid
2025-04-09
Initech
$2,310
Pending
2025-04-15
Soylent
$ 540
Paid
UTF-8 + BOM.csv · sheet name
header inference · UTF-8 + BOM
03
SpreadHTML → PDF

HTML to PDF

Capture web content as print-ready PDF. The renderer respects @media print rules, page-break-before/after CSS, embedded web fonts, and per-page headers/footers from print CSS. Vector strokes stay vector, images stay at source resolution, and links remain clickable in the resulting PDF.

Preserves
  • @media print CSS rules
  • Embedded and Google web fonts
  • Section anchors as PDF outline destinations
  • Vector content (SVG, CSS shapes)
Transforms
  • Per-page header / footer from print CSS
  • Page-break-before / after honored
  • Letter / A4 / custom paper size
  • Optional 'no-script' render for archive captures
Formats.html in.pdf outletter · A4 · custom
page.html§
<article>
<h1>Release Notes</h1>
<style media="print">
.cover { page-break-after: always; }
@page { margin: 24mm; }
</style>
<section class="cover"/>
</article>
print
.pdf · A4p. 1
page break
v2.0 · printed— 1 —
@media print · page-break · web fonts
Before · After

What survives the patch from one side to the other.

Three real handoffs that come across the bridgehouse. Source on the left, target on the right, the route stamp in the middle.

FromDOCX·ToPDF
Slip · 001
Sourcereceived

Counsel's contract draft

Word file with active tracked changes, eight numbered headings, two embedded vendor logos, and a redline column down the side margin.

DOCX
§
Word to PDFDOCX → PDF
Targetpatched

Distribution PDF

Same heading hierarchy, redlines flattened to a stamped layer, vendor logos embedded at source resolution, hyperlinks live to the addenda.

PDF
§

TakeawayGoes out for execution as a real PDF — structure intact, redline history kept on a separate layer.

FromXLSX·ToCSV
Slip · 002
Sourcereceived

Quarterly revenue model

Twelve sheets, formulas referencing across sheets, currency-formatted columns, mixed date formats, three macros.

XLSX
§
Excel to CSVXLSX → CSV
Targetpatched

Twelve CSVs · UTF-8 + BOM

One file per sheet named to match. Cell values evaluated, dates ISO-formatted, currency stripped from values but retained in headers.

CSV
§

TakeawayFlows cleanly into pandas, Excel-on-Windows, or an ETL pipeline — no import wizard required.

FromHTML·ToPDF
Slip · 003
Sourcereceived

Web release notes page

Long-form HTML with print stylesheet, embedded code samples, anchored section TOC, and a 'last updated' footer.

HTML
§
HTML to PDFHTML → PDF
Targetpatched

Archive-ready PDF

Print stylesheet applied, section anchors mapped to PDF outline, code samples in monospace font kept embedded, per-page footer carrying the version stamp.

PDF
§

TakeawayGoes into the artifact bucket alongside the build — same notes the customer reads on the web, in a sharable file.

Who works the bridgehouse

Five regulars at the operator's desk.

The personas who reach for Office routes weekly — and the specific handoffs they make. Find the closest match to the file on your desk this morning.

Persona · 01The contracts admin

Word draft Friday, executed PDF Monday

Contracts move from outside counsel as Word, get redlined, and need to land as PDF for execution and archive. Tracked-change history can't be lost; the executed copy must be a real PDF — clickable, indexable, signable.

Reaches for
  • Word to PDF · DOCX → PDF · redlines flattened, hyperlinks live
  • Excel to CSV · Schedule of fees → CSV for the AP system
Persona · 02The finance analyst

Workbook on a desk, CSVs in a pipeline

Quarterly closes live in Excel — formulas, cross-sheet references, currency formatting, three macros. The data team needs CSVs to ingest. Encoding has to be right the first time so Excel-on-Windows and the Linux pipeline both read them cleanly.

Reaches for
  • Excel to CSV · 12 sheets → 12 CSVs · UTF-8 + BOM
  • Word to PDF · Quarterly memo → PDF for the deck
Persona · 03The content publisher

Whitepaper on the web, whitepaper as PDF

Long-form content lives on the marketing site for SEO. Sales wants a sharable PDF that mirrors the live page — same fonts, same charts, same anchored TOC, with a version stamp on every footer.

Reaches for
  • HTML to PDF · Whitepaper page → PDF · @media print
  • Word to PDF · Author drafts → distribution PDFs
Persona · 04The release manager

Release notes in the build artifact bucket

Every release publishes notes on the docs site. The artifact pipeline wants the same notes as a PDF in the build bucket — same content, archived alongside the binaries, downloadable for air-gapped deployments.

Reaches for
  • HTML to PDF · Release notes page → PDF · archive bucket
  • Excel to CSV · Compatibility matrix → CSV for downloads
Persona · 05The HR lead

Policy in Word, distribution as PDF

Policies live in Word for editing — comments, redlines, version comparison. Once approved, they go out as PDFs to every employee inbox and the LMS attachment slot. Hyperlinks to glossary entries have to survive.

Reaches for
  • Word to PDF · Approved policy → PDF · links live
  • HTML to PDF · Intranet handbook → printable PDF
Common Questions

Before you patch any line, a few honest answers.

Question Index
Q01 · 01 / 07

Will my Word styles map to PDF correctly?

Yes — heading hierarchy, paragraph styles, list rendering, and inline images all carry across. FyPDF translates the document model, not pixels: Heading 1 in Word becomes a real PDF outline level 1, numbered lists keep their numbering, and tracked changes can be flattened to a stamped layer or kept as redline annotations on save.
Bridgehouse Reference · 01
7 questions in the bridgehouse FAQIssue 05 · Office
Patch the line

The board is lit. Tell it which line to patch.

Drop the file, pick a route, take the result. Word styles survive, Excel cells stay tabular, HTML print rules are honored — three lines patched, three honest handoffs.

Patch dispatch · 3 routes
Issue 05
All lines · in service · in-browser default
One Suite · Seven Tracks · Twenty-eight Tools and CountingStart with the surface →