summaryrefslogtreecommitdiffstats
path: root/roles/openshift_health_checker
Commit message (Collapse)AuthorAgeFilesLines
* etc_traffic check: factor away short_versionLuke Meyer2017-08-152-6/+4
|
* Merge pull request #5036 from ↵Scott Dodson2017-08-154-11/+17
|\ | | | | | | | | Miciah/openshift_checks-support-ovs-2.7-on-ocp-3.5-and-3.6 openshift_checks: allow OVS 2.7 on OCP 3.5 and 3.6
| * openshift_checks: allow OVS 2.7 on OCP 3.5 and 3.6Miciah Masters2017-08-114-11/+17
| | | | | | | | | | | | | | | | | | rpm_version: Allow package_list items to specify a list value for version. If a list value is provided for a package, pass the check if any version in that list is found. ovs_version: Specify both 2.6 and 2.7 as allowed versions of OVS for OpenShift versions 3.5 and 3.6.
* | Merge pull request #4770 from kwoodson/openshift_repos_refactorScott Dodson2017-08-151-2/+1
|\ \ | | | | | | Removing openshift_repo dependencies. Moving to beginning of openshift_cluster.
| * | Removing dependencies for openshift_repos and setting them up early in the ↵Kenny Woodson2017-08-141-2/+1
| | | | | | | | | | | | cluster build.
* | | Merge pull request #4944 from sosiouxme/20170728-refactor-ansible-mountsScott Dodson2017-08-118-88/+73
|\ \ \ | |/ / |/| | openshift_checks: refactor find_ansible_mount
| * | openshift_checks: refactor find_ansible_mountLuke Meyer2017-08-088-88/+73
| |/ | | | | | | Reuse the code for finding the ansible_mounts mount for a path.
* | Merge pull request #4922 from sosiouxme/20170728-improve-get-varsScott Dodson2017-08-093-7/+73
|\ \ | | | | | | openshift_checks: enable variable conversion
| * | openshift_checks: enable variable conversionLuke Meyer2017-08-013-7/+73
| | |
* | | Merge pull request #4812 from kwoodson/openshift_version_refactorOpenShift Bot2017-08-081-0/+3
|\ \ \ | |_|/ |/| | Merged by openshift-bot
| * | Refactor of openshift_version.Kenny Woodson2017-08-081-0/+3
| | |
* | | Merge pull request #4913 from sosiouxme/20170720-refactor-check-resultsRodolfo Carvalho2017-08-0822-643/+730
|\ \ \ | |/ / |/| | openshift_checks: refactor check results
| * | openshift_checks: refactor logging checksLuke Meyer2017-08-0215-616/+699
| | | | | | | | | | | | | | | | | | | | | Turn failure messages into exceptions that tests can look for without depending on text meant for humans. Turn logging_namespace property into a method. Get rid of _exec_oc and just use logging.exec_oc.
| * | openshift_checks: add property to track 'changed'Luke Meyer2017-08-0212-40/+44
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Introduced the 'changed' property for checks that can make changes to track whether they did or not. Rather than the check's own logic having to track this and include it in the result hash, just set the property and have the action plugin insert it in the result hash after running (even if there is an exception). Cleared out a lot of crufty "changed: false" hash entries.
* | | Merge pull request #4739 from kwoodson/openshift_facts_refactorScott Dodson2017-08-081-4/+0
|\ \ \ | | | | | | | | Reduce openshift_facts dependencies.
| * | | integration tests: keep openshift_version happyLuke Meyer2017-08-071-4/+0
| | | |
* | | | Merge pull request #4960 from ↵OpenShift Bot2017-08-072-5/+98
|\ \ \ \ | |/ / / |/| | | | | | | | | | | juanvallejo/jvallejo/verify-disk-memory-before-upgrade-no-flake Merged by openshift-bot
| * | | add pre-flight checks to ugrade pathjuanvallejo2017-08-022-5/+98
| | |/ | |/|
* | | Merge pull request #4969 from sosiouxme/20170801-tolerate-ovs-beyond-36OpenShift Bot2017-08-022-90/+80
|\ \ \ | |_|/ |/| | Merged by openshift-bot
| * | package_version check: tolerate release version 3.7Luke Meyer2017-08-022-90/+80
| |/ | | | | | | | | | | | | | | | | | | | | Addresses issue https://github.com/openshift/openshift-ansible/issues/4967 For now, any version >= 3.6 is handled as if it were 3.6. We may want to keep that or fine-tune it later. Also, the ovs_version check is not updated. This is a post-install health check (does not block install/upgrade) with an update already in progress so will be addressed there.
* | add fluentd logging driver config checkjuanvallejo2017-08-0111-109/+563
|/
* Normalize list of checks passed to action pluginRodolfo Carvalho2017-07-271-5/+15
| | | | | | | | | This allows users to pass either a string with comma-separated check names or a proper list of strings, whatever is more convenient in the context. For consistency, the same normalization is applied to the 'openshift_disable_check' variable.
* Make LoggingCheck.run return the correct typeRodolfo Carvalho2017-07-271-1/+4
| | | | | | The run method is expected to return a dict. Even though we should not run LoggingCheck by itself, it is still possible to do it and without this change we get an unhandled exception.
* openshift_checks: refactor to internalize task_varsLuke Meyer2017-07-2539-480/+439
| | | | | | | | | Move task_vars into instance variable so we don't have to pass it around everywhere. Also store tmp. Make sure both are filled in on execute_module. In the process, is_active became an instance method, and task_vars is basically never used directly outside of test code.
* openshift_checks: get rid of deprecated module_executorLuke Meyer2017-07-254-25/+10
|
* openshift_checks: improve comments/namesLuke Meyer2017-07-2513-51/+57
|
* Merge pull request #4682 from juanvallejo/jvallejo/verify-logging-index-timeRodolfo Carvalho2017-07-244-6/+321
|\ | | | | verify sane log times in logging stack
| * verify sane log times in logging stackjuanvallejo2017-07-204-6/+321
| | | | | | | | | | This patch verifies that logs sent from logging pods can be queried on the Elasticsearch pod within a reasonable amount of time.
* | Merge pull request #4316 from ↵Rodolfo Carvalho2017-07-204-0/+434
|\ \ | | | | | | | | | | | | juanvallejo/jvallejo/add-increased-etcd-traffic-check add check to detect increased etcd traffic
| * | add etcd increased-traffic checkjuanvallejo2017-07-194-0/+434
| | |
* | | openshift_checks/docker_storage: overlay/2 supportLuke Meyer2017-07-192-42/+242
| | | | | | | | | | | | | | | | | | | | | | | | fix bug 1469197 https://bugzilla.redhat.com/show_bug.cgi?id=1469197 When Docker is configured with the overlay or overlay2 storage driver, check that it is supported and usage is below threshold.
* | | Allow OVS 2.7 in latest OpenShift releasesRodolfo Carvalho2017-07-172-32/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Change the package_version check to tolerate either Open vSwitch 2.6 or 2.7. Fixes https://bugzilla.redhat.com/show_bug.cgi?id=1465882 This commit removes a unit test that adds no coverage and tests data instead of logic. This coupling makes every change to supported versions require the same changes to the tests.
* | | Make aos_version module handle multiple versionsRodolfo Carvalho2017-07-172-24/+93
| | | | | | | | | | | | | | | | | | | | | | | | Some packages are supported at more than one major.minor version at the same time. Support is added keeping backward compatibility: the 'version' key can be either a string (single version) or a list of versions.
* | | Split positive and negative unit testsRodolfo Carvalho2017-07-171-50/+40
| | | | | | | | | | | | | | | | | | | | | Split positive and negative tests into their own functions. This means less lines of code, clearer purpose, easier to understand what each test does or doesn't and to add new test cases.
* | | add scheduled pods checkjuanvallejo2017-07-112-2/+32
| | |
* | | Only store failures that were not ignored.Rodolfo Carvalho2017-07-111-1/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | In the past, health checks were implemented with ignore_errors: True in the playbook level, requiring us to store all failures, ignored or not, so that we could report on all failed checks. Now checks are run from a single action plugin entry point, without ignoring errors (all errors are aggregated via the action plugin). Since the integration of the openshift_health_checker role with the install playbook, failure summaries are part of the output of a lot more calls to ansible-playbook. We shall report only failures that caused the execution to stop, as ignored failures in the summary only serve to confuse users.
* | | Add overlay to supported Docker storage driversRodolfo Carvalho2017-07-112-3/+3
| | | | | | | | | | | | | | | | | | | | | Fixes https://bugzilla.redhat.com/show_bug.cgi?id=1467809 As a next step, we can refine under which conditions the overlay driver is supported.
* | | openshift_checks: fix execute_module paramsLuke Meyer2017-07-114-4/+4
| | | | | | | | | | | | | | | | | | | | | | | | Fix where execute_module was being passed task_vars in place of tmp param. Most modules don't seem to use either and so this doesn't fail; but under some conditions (perhaps different per version of ansible?) it tried to treat the dict as a string and came back with a python stack trace.
* | | Merge pull request #4655 from sosiouxme/20170630-atomic-etcd-bz1466622OpenShift Bot2017-06-302-1/+16
|\ \ \ | | | | | | | | Merged by openshift-bot
| * | | docker_image_availability: fix containerized etcdLuke Meyer2017-06-302-1/+16
| | |/ | |/| | | | | | | fixes bug 1466622 - docker_image_availability check on etcd host failed for 'openshift_image_tag' is undefined
* | | Merge pull request #4607 from sosiouxme/20170627-docker-storage-vgs-unitsOpenShift Bot2017-06-301-1/+1
|\ \ \ | | | | | | | | Merged by openshift-bot
| * | | docker_storage check: make vgs return sane outputLuke Meyer2017-06-271-1/+1
| |/ / | | | | | | | | | | | | | | | | | | | | | | | | | | | fix bug 1464974 https://bugzilla.redhat.com/show_bug.cgi?id=1464974 Specify --units on vgs call. In my testing with lvm 2.0.2.171(2) on RHEL Atomic Host 7.4, this turned a response of "<4.07g" into "4.07g" which should resolve the issue. I haven't found what the "<" is for in the first place but I'm thinking this should at least be a safe change.
* | | Merge pull request #4565 from rhcarvalho/handle-incorrect-check-namesOpenShift Bot2017-06-302-4/+13
|\ \ \ | | | | | | | | Merged by openshift-bot
| * | | Capture exceptions when resolving available checksRodolfo Carvalho2017-06-232-4/+13
| |/ / | | | | | | | | | | | | | | | | | | Calling the action plugin (e.g. when running a playbook) with an incorrect check name was raising an unhandled exception, leading to poor output in Ansible (requiring a higher verbosity level to see what is going wrong).
* | | Enable disk check on containerized installsRodolfo Carvalho2017-06-222-15/+11
| | | | | | | | | | | | | | | | | | | | | According to the docs the disk requirements should be similar to non-containerized installs. https://docs.openshift.org/latest/install_config/install/rpm_vs_containerized.html#containerized-storage-requirements
* | | Add module docstringRodolfo Carvalho2017-06-221-1/+2
| | |
* | | Add suggestion to check disk space in any pathRodolfo Carvalho2017-06-221-1/+5
| | |
* | | Require at least 1GB in /usr/bin/local and tempdirRodolfo Carvalho2017-06-222-1/+15
| | | | | | | | | | | | During install, those paths are used and require some free space.
* | | Refactor DiskAvailability for arbitrary pathsRodolfo Carvalho2017-06-222-34/+64
|/ / | | | | | | Prepare the check to support verifying multiple paths, not only /var.
* | Disable TLS verification in skopeo inspectRodolfo Carvalho2017-06-191-1/+1
| | | | | | | | | | | | | | Some registries are not configured with valid certificates and thus the check fails with 'http: server gave HTTP response to HTTPS client'. Since this is not fetching images, but only checking for existence, trade security for convenience.