Type: Epic
Resolution: Done
Priority: Major
Fix Version/s: 2021Q1
Affects Version/s: None
Component/s: None
Labels:
- committed

Epic Name:
S3 Big Data Pipeline
Epic Status:
Done
Feature Link:
COST-10 - New data architecture that includes Data Hub as big data pipeline
Parent Link:
COST-10New data architecture that includes Data Hub as big data pipeline
Hierarchy Progress Bar:

0% To Do, 0% In Progress, 100% Done

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

User Story

As developers we want cost management data stored in S3 and processed using a big data tool so that we can store data for longer periods and process large amounts of data more efficiently.

Prioritization / Business Case

We have already done spike and PoC work toward this objective, this is just aiming for a complete solution where we process and store data using S3 and a big data tool
Scale and process more customers

Out Of Scope

Although this will enable longer term storage, this is just getting the infrastructure in place, not actually doing work for anything beyond 2 months of data in the API/UI

Impacts

API
Data Engineering
Database

External Dependencies

OpenShift Quota
Presto image from metering team (regular sync of version; like Python or Django)

Documentation Requirements

We might want to document how we store users data and highlight steps taken to ensure security

Backend Requirements

Are there any prerequisites required before working this epic? No

QE Requirements

Does QE need specific data or tooling to successfully test this epic? S3 Bucket for ephemeral environments (just use path structure)?

Release Criteria

Can this epic be released as individual issues/tasks are completed? Partially, infrastructure can be deployed but summarization should switch flows from Postgresql to Presto in a single deployment
Can the backend be released without the frontend? Yes
Has QE approved this epic? Yes

relates to

COST-857 OCP presto processing in Presto not making it to postgres

Closed

COST-888 Presto OCP Capacity reporting wrong value

Closed

Details

Description

User Story

Prioritization / Business Case

Out Of Scope

Impacts

Related Stories

External Dependencies

Documentation Requirements

Backend Requirements

QE Requirements

Release Criteria

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates