Abstract
A system can accomplish an objective specified in temporal logic while interacting with an unknown, dynamic but rule-governed environment, by employing grammatical inference and adapting its plan of action on-line. The purposeful interaction of the system with its unknown environment can be described by a deterministic two-player zero-sum game. Using special new product operations, the whole game can be expressed with a factored, modular representation. This representation not only offers computational benefits but also isolates the unknown behavior of the dynamic environment in a particular subsystem, which then becomes the target of learning. As the fidelity of the identified environment model increases, the strategy synthesized based on the learned hypothesis converges in finite time to the one that satisfies the task specification.
| Original language | English |
|---|---|
| Pages (from-to) | 378-391 |
| Number of pages | 14 |
| Journal | Engineering Applications of Artificial Intelligence |
| Volume | 37 |
| DOIs | |
| State | Published - Jan 1 2015 |
Keywords
- Adaptive systems
- Grammatical inference
- Temporal logic control
Fingerprint
Dive into the research topics of 'Symbolic planning and control using game theory and grammatical inference'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver