The Blog

rss feed

...

Parquet.Net 3.2.0 Released

Parquet 3.2.0 is released which marks a new stage in powerful capabilities of serializing C# classes to parquet files. Serialization is one of the original Parquet.Net features no one else amongst other parquet implementation supp ...

Read more
...

What's new coming to Parquet 3.1.2

v3.1.2 will be the next minor release of Apache Parquet for .NET and is mostly around improving row-based utilities. It's also launches the first steps towards integrating this library with JSON, specifically Table and Row classes' .ToString() method by default will now produce a multiline J ...

Read more
...

What's wrong with Parquet.Net v2

Apache Parquet for .NET has come a long long way since the original idea in June 2017 (the first commit backdates to June 5). V1 ...

Read more
...

Reading and Writing Parquet Files in Different Languages

Python In python, the easiest option is to use fastparquet package. If you're using conda simply type: conda install fastparquet Fastparquet is an amazing python implementation and is my personal favorite. It's ease ...

Read more
...

What's coming in Parquet.Net 3.1

Parquet.Net is about to be released in the following few days. Since v3.0 was pushed to the public, it saw a lot of interest and appraisal for it's incredible performance boost, however there were problems as well. To reiterate, v3.0 was a comple ...

Read more
...

Apache Parquet on .NET

Preamble If you are in Big Data, you know about Apache Parquet format. It's a de facto standard for storing enormous amounts of data on big data processing clusters like Apache Spark. What's making it so good is all data stored as bi ...

Read more