No Apache Parquet videos yet. You could help us improve this page by suggesting one.
You could say a lot of things about AWS, but among the cloud platforms (and I've used quite a few) AWS takes the cake. It is logically structured, you can get through its documentation relatively easily, you have a great variety of tools and services to choose from [from AWS itself and from third-party developers in their marketplace]. There is a learning curve, there is quite a lot of it, but it is still way easier than some other platforms. I've used and abused AWS and EC2 specifically and for me it is the best.
Based on our record, Amazon AWS seems to be a lot more popular than Apache Parquet. While we know about 463 links to Amazon AWS, we've tracked only 25 mentions of Apache Parquet. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Today, we are entering the Agentic Era. Agentic apps promise to deliver an unprecedented productivity boost, but to do so, they need access to the most sensitive business data: conversations, documents, decisions. Customers do not want to transfer such data to an unknown and untrusted external provider's environment. Instead, they expect these products to run inside their cloud accounts (whether it be AWS, GCP, or... - Source: dev.to / 18 days ago
Create AWS account and activate account with card and mobile verification. - Source: dev.to / 25 days ago
Anthropic's Claude models, accessible via platforms like AWS Bedrock, complement these by handling long-context tasks effectively. Rajesh Pandey, Principal Engineer at Amazon Web Services, highlights the importance of such foundation models: "OpenAI (via API) and Anthropic Claude (via AWS Bedrock) offer strong general-purpose LLMs with reliable inference." These models are lightweight yet powerful, suitable for... - Source: dev.to / about 2 months ago
Introduction Imagine this: You run a small e-commerce site. Itโs Black Friday, traffic is flooding inโฆ and your main server suddenly crashes. Normally, this means lost sales, angry customers, and a long night for your IT team. But with Amazon EC2 (Elastic Compute Cloud), your app keeps running because your servers arenโt tied to a single machine โ they live in the AWS cloud, spread across multiple data... - Source: dev.to / about 2 months ago
If you don't have one yet, sign up at AWS. - Source: dev.to / about 2 months ago
If there was a way to package and compress the Excel spreadsheet in a web-friendly format, then there's nothing stopping us from loading the entire dataset in the browser!1 Sure enough, the Parquet file format was specifically designed for efficient portability. - Source: dev.to / about 1 month ago
Iceberg decouples storage from compute. That means your data isnโt trapped inside one proprietary system. Instead, it lives in open file formats (like Apache Parquet) and is managed by an open, vendor-neutral metadata layer (Apache Iceberg). - Source: dev.to / 6 months ago
Data prep kit github repository: https://github.com/data-prep-kit/data-prep-kit?tab=readme-ov-file Quick start guide: https://github.com/data-prep-kit/data-prep-kit/blob/dev/doc/quick-start/contribute-your-own-transform.md Provided samples and examples: https://github.com/data-prep-kit/data-prep-kit/tree/dev/examples Parquet: https://parquet.apache.org/. - Source: dev.to / 6 months ago
Deliver nice ready-to-use data as duckdb, parquet and csv. - Source: dev.to / 6 months ago
Push the dataset to hugging face in parquet format. - Source: dev.to / 11 months ago
DigitalOcean - Simplifying cloud hosting. Deploy an SSD cloud server in 55 seconds.
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data.
Microsoft Azure - Windows Azure and SQL Azure enable you to build, host and scale applications in Microsoft datacenters.
Apache Spark - Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
Linode - We make it simple to develop, deploy, and scale cloud infrastructure at the best price-to-performance ratio in the market.
DuckDB - DuckDB is an in-process SQL OLAP database management system