<feed xmlns="http://www.w3.org/2005/Atom"> <id>https://morningman.github.io/</id><title>Rayner's Notes</title><subtitle>The personal blog of Mingyu Chen (Rayner), featuring technical deep dives into data systems, open-source development, software architecture, and occasional personal musings.</subtitle> <updated>2026-04-26T00:32:02-07:00</updated> <author> <name>Mingyu Chen(Rayner)</name> <uri>https://morningman.github.io/</uri> </author><link rel="self" type="application/atom+xml" href="https://morningman.github.io/feed.xml"/><link rel="alternate" type="text/html" hreflang="en" href="https://morningman.github.io/"/> <generator uri="https://jekyllrb.com/" version="4.4.1">Jekyll</generator> <rights> © 2026 Mingyu Chen(Rayner) </rights> <icon>/assets/img/favicons/favicon.ico</icon> <logo>/assets/img/favicons/favicon-96x96.png</logo> <entry><title>How We Built Production Vector Search Inside an Analytical Database: 900 QPS at 97% Recall</title><link href="https://morningman.github.io/posts/doris-41-vector-index/" rel="alternate" type="text/html" title="How We Built Production Vector Search Inside an Analytical Database: 900 QPS at 97% Recall" /><published>2026-04-24T10:00:00-07:00</published> <updated>2026-04-24T17:55:04-07:00</updated> <id>https://morningman.github.io/posts/doris-41-vector-index/</id> <content type="text/html" src="https://morningman.github.io/posts/doris-41-vector-index/" /> <author> <name>Mingyu Chen(Rayner)</name> </author> <category term="Data Engineering" /> <category term="Apache Doris" /> <summary>Apache Doris 4.1 adds native vector search: IVF and disk-tiered SPANN indexes, PQ quantization, and hybrid BM25+vector queries planned in a single SQL.</summary> </entry> <entry><title>Integrating Lance into Doris: Notes from the Rust Side</title><link href="https://morningman.github.io/posts/doris-lance-rust-integration/" rel="alternate" type="text/html" title="Integrating Lance into Doris: Notes from the Rust Side" /><published>2026-04-23T19:00:00-07:00</published> <updated>2026-04-23T21:35:29-07:00</updated> <id>https://morningman.github.io/posts/doris-lance-rust-integration/</id> <content type="text/html" src="https://morningman.github.io/posts/doris-lance-rust-integration/" /> <author> <name>Mingyu Chen(Rayner)</name> </author> <category term="Data Engineering" /> <category term="Apache Doris" /> <summary>How Apache Doris embeds a Rust-based Lance reader in its C++ BE via the Arrow C Data Interface, a single-threaded Tokio runtime, and Corrosion.</summary> </entry> <entry><title>Doris Segment V3 vs Parquet &amp; Lance: Fixing Footer Metadata Explosion</title><link href="https://morningman.github.io/posts/segment-v3-vs-open-formats/" rel="alternate" type="text/html" title="Doris Segment V3 vs Parquet &amp;amp; Lance: Fixing Footer Metadata Explosion" /><published>2026-04-23T15:00:00-07:00</published> <updated>2026-04-24T00:32:08-07:00</updated> <id>https://morningman.github.io/posts/segment-v3-vs-open-formats/</id> <content type="text/html" src="https://morningman.github.io/posts/segment-v3-vs-open-formats/" /> <author> <name>Mingyu Chen(Rayner)</name> </author> <category term="Data Engineering" /> <category term="Apache Doris" /> <summary>How Parquet's Flatbuffer proposal, Lance, and Doris segment v3 each tackle columnar footer explosion when wide tables push metadata from KB into MB.</summary> </entry> <entry><title>Apache Doris 4.1 on Iceberg V3: Running the Full Lakehouse Lifecycle from One SQL Engine</title><link href="https://morningman.github.io/posts/apache-doris-41-iceberg-v3-lakehouse/" rel="alternate" type="text/html" title="Apache Doris 4.1 on Iceberg V3: Running the Full Lakehouse Lifecycle from One SQL Engine" /><published>2026-04-19T10:00:00-07:00</published> <updated>2026-04-23T18:23:19-07:00</updated> <id>https://morningman.github.io/posts/apache-doris-41-iceberg-v3-lakehouse/</id> <content type="text/html" src="https://morningman.github.io/posts/apache-doris-41-iceberg-v3-lakehouse/" /> <author> <name>Mingyu Chen(Rayner)</name> </author> <category term="Data Engineering" /> <category term="Apache Iceberg" /> <summary>Apache Doris 4.1 adds Iceberg V3 support: run UPDATE, DELETE, and MERGE INTO on Iceberg tables with Deletion Vectors and Row Lineage, all from SQL.</summary> </entry> <entry><title>Beyond JSON: The Evolution of Variant Data Types in Modern Analytics</title><link href="https://morningman.github.io/posts/beyond-json-variant-data-types/" rel="alternate" type="text/html" title="Beyond JSON: The Evolution of Variant Data Types in Modern Analytics" /><published>2026-04-13T17:00:00-07:00</published> <updated>2026-04-23T18:23:19-07:00</updated> <id>https://morningman.github.io/posts/beyond-json-variant-data-types/</id> <content type="text/html" src="https://morningman.github.io/posts/beyond-json-variant-data-types/" /> <author> <name>Mingyu Chen(Rayner)</name> </author> <category term="Data Engineering" /> <category term="Semi-Structured Data" /> <summary>JSON stopped being a supporting format and became core business data. Why analytics engines need Variant types to keep up — and how Snowflake, Doris, and Iceberg are solving it.</summary> </entry> </feed>
