[doc] Update proposal document
Update proposal document with description of problem space Draft rules engine description in openstack_integration document
This commit is contained in:
parent
fa8777fd3e
commit
c71f189faf
BIN
doc/images/openstack_cloud_lifecycle.png
Normal file
BIN
doc/images/openstack_cloud_lifecycle.png
Normal file
Binary file not shown.
After Width: | Height: | Size: 13 KiB |
7
doc/images/src/openstack_cloud_lifecycle.txt
Normal file
7
doc/images/src/openstack_cloud_lifecycle.txt
Normal file
@ -0,0 +1,7 @@
|
||||
@startuml
|
||||
|
||||
(*) -right-> [<i>OpenStack Services</i>\nNova, Keystone, Neutron,\nGlance, Heat, Swift] "Deployment"
|
||||
"Deployment" -right-> [<i>OpenStack Deployment</i>\nFuel, TripleO, Devstack] "Operation\nMaintenance"
|
||||
"Operation\nMaintenance" -right-> [<i>DRAGONS?</i>\nTuskar, <b>Rubick</b>] (*)
|
||||
|
||||
@enduml
|
@ -11,21 +11,30 @@ Project Name
|
||||
Overview
|
||||
--------
|
||||
|
||||
The typical OpenStack cloud life cycle consists of 2 phases:
|
||||
|
||||
- initial deployment and
|
||||
- operation maintenance
|
||||
|
||||
OpenStack cloud operators usually rely on deploymnet tools to configure all the
|
||||
platform components correctly and efficiently upfront. However, after initial
|
||||
deployment platform configurations and operational conditions start to change.
|
||||
These changes could break consistency and integration of cloud platform and its
|
||||
components, and ultimately cause cloud service failures of different kinds.
|
||||
platform components correctly and efficiently in *initial deployment* phase.
|
||||
Multiple OpenStack projects cover that area: TripleO/Tuskar, Fuel and Devstack,
|
||||
to name a few.
|
||||
|
||||
However, once you installed and kicked off the cloud, platform configurations
|
||||
and operational conditions begin to change. These changes could break
|
||||
consistency and integration of cloud platform components. Keeping cloud up and
|
||||
running is the essense of *operation maintenance* phase.
|
||||
|
||||
Cloud operator must quickly and efficiently identify and respond to the root
|
||||
cause of such failures. To do so, he must check if his OpenStack configuration
|
||||
is sane and consistent. These checks could be thought of as rules of diagnostic
|
||||
production system.
|
||||
|
||||
Currently OpenStack ecosystem does not provide tools which specifically help to
|
||||
diagnose platform configuration. We propose a project which will help operators
|
||||
to diagnose their OpenStack platform and reduce response time to known and
|
||||
unknown failures.
|
||||
Currently OpenStack ecosystem lacks projects aimed to increase reliability and
|
||||
resilience of the cloud. With this proposal we want to introduce a project which
|
||||
will help operators to diagnose their OpenStack platform, reduce response time
|
||||
to known and unknown failures and effectively support the desired SLA.
|
||||
|
||||
Mission
|
||||
-------
|
||||
|
@ -1,5 +1,5 @@
|
||||
VALIDATOR INTEGRATION WITH OPENSTACK
|
||||
====================================
|
||||
DIAGNOSTICS INTEGRATION WITH OPENSTACK
|
||||
======================================
|
||||
|
||||
--------
|
||||
Overview
|
||||
@ -50,8 +50,29 @@ and inconsistencies.
|
||||
This engine will provide hints and best practices to increase reliability and
|
||||
operational resilience of the cloud.
|
||||
|
||||
Rules engine
|
||||
------------
|
||||
#FIXME: move this part to document rules_engine.rst
|
||||
|
||||
Rules-based approach to diagnostics
|
||||
-----------------------------------
|
||||
|
||||
The consistent configuration across all components is essential to OpenStack
|
||||
cloud operation. If something is wrong with configuration, you as an operator
|
||||
will know this immidiately either from monitoring or clients complaining. But
|
||||
diagnosing the exact problem is always a challenge, given the number of
|
||||
components and configuration options per component.
|
||||
|
||||
You could think about troubleshooting OpenStack as going through some scenarios
|
||||
which can be expressed as sets of rules. Your configuration must comply to all those
|
||||
rules to be operational. On the other hand, if you know rules which your
|
||||
configuration breaks, you can identify incorrect parameters reliably and easy.
|
||||
That is how production rules or diagnostic systems work.
|
||||
|
||||
Example production rule for OpenStack system could be::
|
||||
|
||||
if (condition)parameter) is (value) then (check_parameter_1) must be (value) and
|
||||
(check_parameter_2) must be (value)
|
||||
|
||||
|
||||
|
||||
------------------
|
||||
Integration Points
|
||||
|
@ -0,0 +1,5 @@
|
||||
PRODUCTION RULES ENGINE
|
||||
=======================
|
||||
|
||||
This document describes rules engine used for inspection and diagnostics of
|
||||
OpenStack configuration.
|
Loading…
x
Reference in New Issue
Block a user