Software Alternatives, Accelerators & Startups

How To Start Your Next Data Engineering Project

Apache Spark Apache Druid Delta Lake D3.js Google BigQuery Amazon S3 Amazon AWS Apache Airflow
  1. Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
    Pricing:
    • Open Source

    #Databases #Big Data #Big Data Analytics 70 social mentions

  2. Fast column-oriented distributed data store
    Pricing:
    • Open Source

    #Databases #Data Analysis #Relational Databases 10 social mentions

  3. Application and Data, Data Stores, and Big Data Tools
    Pricing:
    • Open Source

    #Office & Productivity #Development #Data Dashboard 35 social mentions

  4. 4
    D3.js is a JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG, and CSS.
    Pricing:
    • Open Source

    #Javascript UI Libraries #Charting Libraries #Data Visualization 167 social mentions

  5. A fully managed data warehouse for large-scale data analytics.
    Pricing:
    • Open Source
    If you wanted to upgrade that idea, track down articles relating to that swing for discussion and post those. There is definite value in that data, and it is a pretty simple thing to do. You are just using a Cloud Composer to ingest the data and storing it in a data warehouse like BigQuery or Snowflake, creating a Twitter bot to post outputs to Twitter using something like Airflow.

    #Data Management #Data Warehousing #Data Dashboard 42 social mentions

  6. Amazon S3 is an object storage where users can store data from their business on a safe, cloud-based platform. Amazon S3 operates in 54 availability zones within 18 graphic regions and 1 local region.

    #Cloud Hosting #Object Storage #Cloud Storage 198 social mentions

  7. Amazon Web Services offers reliable, scalable, and inexpensive cloud computing services. Free to join, pay only for what you use.

    #Cloud Computing #Cloud Infrastructure #IaaS 446 social mentions

  8. Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.
    Pricing:
    • Open Source

    #Workflows #Workflow Automation #Data Pipelines 75 social mentions

Discuss: How To Start Your Next Data Engineering Project

Log in or Post with