AWS Glue DataBrew now supports the ORC file format as an input - devamazonaws.blogspot.com

AWS Glue DataBrew customers are now able to clean and transform data stored in the Optimized Row Columnar (ORC) file format, a widely used data format for storing Hive data. When creating a dataset in AWS Glue DataBrew, you can now use ORC files in addition to already supported Apache Avro, Apache Parquet, Microsoft Excel, CSV, and JSON file formats.  

Post Updated on April 01, 2022 at 11:07PM

Comments

Popular posts from this blog

[MS] Pulling a single item from a C++ parameter pack by its index, remarks - devamazonaws.blogspot.com

[MS] Debugger breakpoints are usually implemented by patching the in-memory copy of the code - devamazonaws.blogspot.com

[MS] The case of the crash when destructing a std::map - devamazonaws.blogspot.com