Literature on Cloud Capacity Planning
Description
This release captures the state-of-the-art (as of 2020) in cloud capacity planning literature and provides a set of complementary scripts to analyze this literature. The dataset which is central to this release (publications.yaml) maps 57 cloud capacity planning approaches as published in literature to the taxonomy on cloud capacity planning which the authors of this release have proposed. The approaches were gathered with a systematic literature survey process, aggregating multiple common sources and executing a set of automated and manual filtering steps.
Taxonomy
The taxonomy and the process used to derive it is described in detail in the MSc Thesis of Georgios Andreadis at Delft University of Technology (to be published end of August 2020), on cloud capacity planning. We describe the taxonomy here to provide context to the raw data.
The taxonomy divides the process underlying capacity planning systems into the following categories:
- System Model
- Workloads
- Resources
- Model Inputs
- Forecast Model
- Modeling Strategy
- Model Structure
- Decision Support
- Role
- Type of Advice
- Advice Method
For each of these categories, the taxonomy prescribes a set of possible classes (possible instantiations of the category). We list these for each category, below, preceded by its abbrevation as appearing in the dataset:
- System Model
- Workloads
VM: Virtual MachinesDB: DatabasesS: Streaming WorkloadsBD: Big Data FrameworksWS: Web ServiceB: Batch Jobs
- Resources
C: Compute HardwareS: Storage HardwareN: Network HardwareE: Energy Hardware (Storage and Supply)H: Heat Control HardwareV: Virtualized Resources (VM, containers, etc.)
- Model Inputs
H: Historical DataRS: Resource SpecificationsB: (Micro)Benchmarks or Systematic Performance TestsS: SLAsP: Pricing DataLC: Lease ContractsHP: Human Personnel-related Factors
- Workloads
- Forecast Model
- Modeling Strategy
A: AnalyticalS: SimulationE: Real-world Experimentation
- Model Structure
U: Unconditional ExtrapolationW: What-if Scenarios
- Modeling Strategy
- Decision Support
- Role
F: ForecastA: Adaptation Advice
- Type of Advice
N: Number of ResourcesT: Type of ResourcesL: Locality of Resources
- Advice Method
H: HeuristicR: RegressionL: Local SearchSS: Stochastic SearchSP: Stochastic ProgrammingNN: Neural NetworkGT: Game TheoryGA: Genetic AlgorithmNLP: (Non)Linear Programming
- Role
File Structure
This release is structured as follows:
publications.yaml: This is the dataset of mappings of publications to the taxonomy. Each item in the array represents a publication, with a set of true-false classifications per category for each class.- The
idfield of each publication identifies the publication (first-author and publication year). - The
summaryfield of each publication summarizes the publication in a short sentence. - The
classificationfield contains a set of true-false classifications per category for each class. - The
notesfield is an optional field containing any additional notes kept by the author of this dataset on their classification, in the case where doubts arose during the classification process.
- The
taxonomy.py: Script which parses the YAML dataset into different CSV views per category, to facilitate meta-analysis. Also prints out a full (long-table) representation of the mappings.taxonomy_analysis.py: Jupyter notebook which contains several meta-analysis processing steps, including trend, cluster, and correlation analysis.README.md: A file containing this description.
Files
cloud-capacity-planning-literature-1.0.0.zip
Files
(781.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:d3f94520194c97d4ec314feace431e07
|
781.8 kB | Preview Download |