-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Requirements document for on-device testing (#1161) #1283
Merged
+103
−0
Merged
Changes from all commits
Commits
Show all changes
2 commits
Select commit
Hold shift + click to select a range
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,103 @@ | ||
# CHIP on-device testing | ||
|
||
_Requirements and high-level design_ | ||
|
||
## Background | ||
|
||
The ability to run tests on actual and emulated hardware is paramount in | ||
embedded projects. CHIP is no exception. We want on-device testing to be a first | ||
class goal of CHIP architecture. On-device testing requirements apply both to | ||
Continuous Integration testing for main CHIP software stack development and to | ||
eventual CHIP product certification. This document explores the requirements and | ||
evaluates potential solutions. | ||
|
||
## Overview of requirements | ||
|
||
A good device test infrastructure is built on four pillars. | ||
|
||
### Pillar 1: Using a test framework | ||
|
||
A test framework provides a testing structure that developers can follow and | ||
potentially reduces some of the burden of test setup and teardown (less | ||
boilerplate). Support for state-oriented and asynchronous structuring of tests | ||
would be beneficial. Many test frameworks leverage scripting languages such as | ||
Python to simplify the quick development of tests and to leverage rich sets of | ||
libraries for device/systems access and results generation. | ||
|
||
### Pillar 2: Dispatching tests | ||
|
||
Tests can run on lab machines or on the developer's local workstation. Tests can | ||
be triggered manually by the developer or as a result of completion of a | ||
changeset built on a continuous integration (CI) server. CHIP involves multiple | ||
stakeholders, many of which will want to contribute to the testing efforts with | ||
lab capacity. The infrastructure therefore must be prepared for | ||
cross-organization test dispatch. | ||
|
||
To facilitate uniform dispatch of tests we will probably need a simple | ||
request/response protocol. Potentially HTTPS based and RESTful. Due to the long | ||
running nature of device tests the response for a test scheduling request could | ||
be a test ID, not the test result. That ID could be used to query the test | ||
status, subscribe for notifications on status changes and to pull the test | ||
results. Core aspects of such a scheme include the conventions for request | ||
artifacts contents and minimum expected results contents once the run is | ||
complete. | ||
|
||
### Pillar 3: Interacting with devices | ||
|
||
The test host environment has to reset devices, flash images on them, issue | ||
commands, monitor status and collect test results. It may also need to integrate | ||
both virtual (simulated) and real devices together. This can at first be done in | ||
an ad-hoc way per platform but eventually we can go into device access | ||
abstraction, i.e. define a common device testing interface which CHIP-compliant | ||
devices can expose. The test host has to be prepared for driving multiple | ||
devices at the same time for a single test, e.g. for tests that check | ||
communication between multiple devices. | ||
|
||
### Pillar 4: Collecting results | ||
|
||
Ideally, test results are output in standard formats and similar or analogous | ||
results between different devices and tests are output the same way. This | ||
ensures reusability of code that processes similar data while allowing | ||
aggregation of results across different dimensions. Failed tests must propagate | ||
errors from device platform layers all the way to the CHIP stack and present | ||
errors and potential stack traces in a standard result format. As the purpose of | ||
on-device tests is to capture bugs, it is important that the test outputs | ||
highlight the failure reason(s) and developers don't have to browse through | ||
thousands of lines of logs to find the one line that sheds light on why a test | ||
failed. | ||
|
||
## Priorities | ||
|
||
In the spirit of CHIP's charter, it would be great to see something taking-off | ||
as soon as possible, to support continuous testing of the evolving CHIP stack. | ||
We could then improve on that first iteration, even if we have to throw away | ||
some temporary concepts and code. | ||
|
||
Test dispatch (Pillar 2) arises as the highest priority, because all other | ||
pillars can have ad-hoc solutions. The first need is an interface between a | ||
CircleCI job and a test execution host at a participating organization. This | ||
would enable dispatching tests to a variety of existing in-house infrastructure, | ||
while retaining common request/response protocols to shield the CI system from | ||
implementation details of each lab. | ||
|
||
The next most important goal is to provide a test framework (Pillar 1). With a | ||
standard framework developers can start writing tests, even if those tests will | ||
be device specific and of ad-hoc input and output format. The general structure | ||
of tests will however be present and later the tests can be adapted to standard | ||
interactions (Pillar 3) and result formats (Pillar 4). | ||
|
||
Specifying result formats (Pillar 4) for the most common outputs | ||
(success/failure, failure reason, stack trace, memory and CPU usage time series, | ||
pcaps of network traffic, etc.) will be an ongoing effort. The simplest output | ||
formats can be specified together with the test framework. | ||
|
||
Lastly, we want to look into a common device interaction interface that would | ||
enable reusing tests between different devices. | ||
|
||
## Baseline hardware platforms for CHIP | ||
|
||
The TSG is targeting the following platforms/boards for early bringup: | ||
|
||
- Nordic nRF52 board <TODO: REF> | ||
- SiLabs XXXX board <TODO:REF> | ||
- Espressif ESP32 XXXX board <TODO:REF> |
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should always support Linux, macOS with BSD Sockets + LwIP (+ Network.framework for macOS) variants as well, at all times. If possible, we should also add iOS and Android, where it makes sense and where possible.