Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Major
Fix Version/s: JIRA-MIgration-Old-Issues
Affects Version/s: None
Component/s: Data Pipeline
Labels:

Epic Link:
Data Scaling
Acceptance Criteria:

Hide
* OpenShift on AWS SQL matches using the tag summary tables
* OpenShift on Azure SQL matches using the tag summary tables
* Performance is generally not slower than before this change (we just want to sanity check that we aren't regressing and making things worse)

Show
* OpenShift on AWS SQL matches using the tag summary tables * OpenShift on Azure SQL matches using the tag summary tables * Performance is generally not slower than before this change (we just want to sanity check that we aren't regressing and making things worse)
Feature Link:
COST-10 - New data architecture that includes Data Hub as big data pipeline
Git Pull Request:
https://github.com/project-koku/koku/pull/2271

Sprint:
COST Sprint 49, COST Sprint 50, COST Sprint 51, COST Sprint 52

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

User Story
As a developer I want OpenShift on Cloud Infrastructure summarization to run efficiently so that we can better handle production workloads.

Assumptions
Right now the SQL pulls apart the daily tables by tag
e.g. https://github.com/project-koku/koku/blob/master/koku/masu/database/sql/reporting_ocpawscostlineitem_daily_summary.sql#L9-L76

See https://github.com/project-koku/koku/issues/1468

We already track the tag keys and values for each provider type: https://github.com/project-koku/koku/blob/master/koku/masu/database/sql/reporting_awstags_summary.sql

If we do https://github.com/project-koku/koku/issues/1367 and https://github.com/project-koku/koku/issues/1057 then the tag summary tables should be filterable by bill/report period and we can do a MUCH faster and simpler tag matching pair down using the tag summary tables JOINED ON tag key and value matching between OpenShift and infrastructure provider tag summary tables.

With the paired down list of matched fields we can then filter our starting data sets to include only the pre-matched tag key/values.

This operation currently is one of the slowest bottlenecks we could optimize.

blocks

COST-101 Move OpenShift on AWS special tag handling to occur first

Closed

Assignee:: Andrew Berglund (Inactive)

Reporter:: Jennifer Albertson

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2020/05/13 10:04 AM

Updated:: 2024/08/29 4:19 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates