add run_pipeline example

Former-commit-id: fca1940bc0 [formerly 358cacbca0] [formerly 6ec74d3030 [formerly bca1f9eff4]] [formerly 40a471ebbc [formerly 5ff1322a4f] [formerly 01067f35f6 [formerly d589fd6fa3]]] [formerly 28c75141e7 [formerly 6504576ef9] [formerly 479ecfa70c [formerly 67352181a3]] [formerly 24fc56bc64 [formerly d2e1003c89] [formerly 2c820b46db [formerly 2f998f6974]]]] [formerly 2468061ced [formerly ee14c316a3] [formerly 8965b7e537 [formerly 8616cc1f24]] [formerly 7dc2aa07c7 [formerly b01f950dce] [formerly 9eaf47ee44 [formerly 733d64d92d]]] [formerly d22278c8d7 [formerly 36f84cc80a] [formerly e253cd1800 [formerly a4efdcb490]] [formerly 59f8404f29 [formerly c16238aeac] [formerly 44bc1fed72 [formerly befce4e2ad]]]]] [formerly 9dc9a2c1c0 [formerly c3229cdf9e] [formerly 8de292e3ff [formerly bfc083a3b6]] [formerly ab9519d4f7 [formerly 1051e5930a] [formerly 6994480a6c [formerly 9efb04dedf]]] [formerly 004687993b [formerly 9a3ba544cb] [formerly 7edab429ae [formerly c8b0fe59e2]] [formerly 1156340669 [formerly 195d7167e1] [formerly d05a060123 [formerly fea8332703]]]] [formerly b7c7af47ac [formerly 6cc66907a8] [formerly a9562377f6 [formerly f17f25a823]] [formerly c7bcf7f499 [formerly 7e5270c8b1] [formerly d8a1e00c0e [formerly 8a3ddff896]]] [formerly 0dce0b991e [formerly 55d997b2a3] [formerly d50a27d23a [formerly 71a554278c]] [formerly 838444a6cb [formerly a6147af1df] [formerly 97b80b6400 [formerly 6bb1e359c4]]]]]] Former-commit-id: 24bbedc26a [formerly d30bc965a1] [formerly ce97c9d016 [formerly b92e7af160]] [formerly bc39ea4b86 [formerly de937a558f] [formerly f028743fbf [formerly 8395df862a]]] [formerly cb7bf8e888 [formerly 9076a02479] [formerly 3bfecdb558 [formerly c06c4451c7]] [formerly f85a40bf61 [formerly f6b5fcd73f] [formerly 1c7c12f8de [formerly f8876d15fd]]]] [formerly ebdbbb16d8 [formerly 90aea8bb08] [formerly 0dadf699f7 [formerly 661ac2ff82]] [formerly a9ba165245 [formerly 45760ac2eb] [formerly c047334390 [formerly 56b517dc66]]] [formerly 536811c453 [formerly 5ec3de6370] [formerly a91e41774e [formerly 076c5f4bdd]] [formerly 2f526d129c [formerly befa3b0121] [formerly 97b80b6400]]]] Former-commit-id: 844fcc1053 [formerly b9417a7909] [formerly 7de8c22b60 [formerly 3f99f9cd14]] [formerly ced1db2fe8 [formerly 1a1f6085a0] [formerly ecded0b584 [formerly fe14c8dec2]]] [formerly c024958050 [formerly 927c066af5] [formerly 22e544266c [formerly 1c38f4088e]] [formerly d9369fb4b6 [formerly 6e5bda1063] [formerly fdbe618e2a [formerly 5b20e98644]]]] Former-commit-id: 557791dbf4 [formerly 344e05a008] [formerly 1a52c34b35 [formerly ed1287afd1]] [formerly c0fc7d644f [formerly 46468fa4ff] [formerly 04e4ccc0f4 [formerly d8f4305146]]] Former-commit-id: a5e5a2c8da [formerly 851a5acd34] [formerly d3d6bc5161 [formerly d5e29f7c9b]] Former-commit-id: 855d9fd528 [formerly a69cbe46e7] Former-commit-id: 9a6a89962c
5 years ago · cf2b170182
--- a/d3m/d3m/index.py
+++ b/d3m/d3m/index.py
@@ -107,8 +107,8 @@ def get_primitive(primitive_path: str) -> typing.Type[base.PrimitiveBase]:
    if not primitive_path:
        raise exceptions.InvalidArgumentValueError("Primitive path is required.")

    if not primitive_path.startswith('d3m.primitives.'):
        raise exceptions.InvalidArgumentValueError("Primitive path does not start with \"d3m.primitives\".")
    #if not primitive_path.startswith('d3m.primitives.'):
    #    raise exceptions.InvalidArgumentValueError("Primitive path does not start with \"d3m.primitives\".")

    path, name = primitive_path.rsplit('.', 1)

--- a/docs/\
+++ b/docs/\
@@ -0,0 +1,39 @@
 Overview
 ========

 Design Principles
 ~~~~~~~~~~~~~~~~~

 TODS follows the design principal of `D3M <http://datadrivendiscovery.org/>`_.
 The toolkit wraps each function into ``Primitive`` class with unified 
 interfaces. The goal of this toolkit is to enable the users to easily develop
 outlier detection system for time series data. The following design principles 
 are applied when developing the toolkit: 
 * **.** Results on the environments can be reproduced. The same result should be obtained with the same random seed in different runs. 
 * **Accessible.** The experiences are collected and well organized after each game with easy-to-use interfaces. Uses can conveniently configure state representation, action encoding, reward design, or even the game rules. 
 * **Scalable.** New card environments can be added conveniently into the toolkit with the above design principles. We also try to minimize the dependencies in the toolkit so that the codes can be easily maintained.

 TODS High-level Design
 ~~~~~~~~~~~~~~~~~~~~~~~~


 .. image:: img/framework.pdf
   :width: 800



 Data Processing
 ---------------


 Timeseries Processing
 ---------------------

 Feature Analysis
 ----------------

 Detection Algorithms
 ---------------------

 Reincforcement
 --------------
--- a/docs/source/getting_started.rst
+++ b/docs/source/getting_started.rst
@@ -5,8 +5,8 @@ In this document, we provide some toy examples for getting started. All
 the examples in this document and even more examples are available in
 `examples/ <https://github.com/datamllab/rlcard/tree/master/examples>`__.

 Playing with Random Agents
 --------------------------
 Constructing Point-wise Detection on NAB Dataset 
 ------------------------------------------------

 We have set up a random agent that can play randomly on each
 environment. An example of applying a random agent on Blackjack is as
--- a/docs/source/img/tods_framework.pdf
+++ b/docs/source/img/tods_framework.pdf
--- a/docs/source/overview.rst
+++ b/docs/source/overview.rst
@@ -1,101 +1,42 @@
 Overview
 ========

 Design Principles
 ~~~~~~~~~~~~~~~~~

 The toolkit wraps each game by ``Env`` class with easy-to-use
 interfaces. The goal of this toolkit is to enable the users to focus on
 algorithm development without caring about the environment. The
 following design principles are applied when developing the toolkit: 
 * **Reproducible.** Results on the environments can be reproduced. The same result should be obtained with the same random seed in different runs. 
 * **Accessible.** The experiences are collected and well organized after each game with easy-to-use interfaces. Uses can conveniently configure state representation, action encoding, reward design, or even the game rules. 
 * **Scalable.** New card environments can be added conveniently into the toolkit with the above design principles. We also try to minimize the dependencies in the toolkit so that the codes can be easily maintained.
 TODS follows the design principal of `D3M <http://datadrivendiscovery.org/>`_.
 The toolkit wraps each function into ``Primitive`` class with an unified 
 interface for various functionalities. The goal of this toolkit is to enable
 the users to easily develop outlier detection system for multivariate time series data. 
 TODS provides three pervasive outlier scenarios for the given time series data.
 * **Point-wise Outliers** are the outliers that occur on time points. In other words, each time point in the time series data could be an outlier.
 * **Pattern-wise Outliers** refer to the scenario that each outlier is define as a subsequence. It is also known as collective outlier detection. 
 * **System-wise Outliers** are defined as a set of time series. For example, each set of time series represents a device (system) that equipped with multiple sensors, where each sensor is represented as an univariate time series. The goal is to detect the anomalous devices from the normal ones.

 TODS High-level Design
 ~~~~~~~~~~~~~~~~~~~~~~~~
 Following the typical machine learning pipeline, there are 6 modules lie in TODS: Data prociessing, time series processing, feature analysis, detection algorithms and reinforcement module.

 This document introduces the high-level design for the environments, the
 games, and the agents (algorithms).

 .. image:: img/framework.pdf
 .. image:: img/tods_framework.pdf
   :width: 800



 Data-Processing
 Data Processing
 ---------------
 Data processing aims on processing data following the tabular fashion. The functionalities including: dataset loading, data filtering, data validation, data binarization, and timestamp transformation.

 We wrap each game with an ``Env`` class. The responsibility of ``Env``
 is to help you generate trajectories of the games. For developing
 Reinforcement Learning (RL) algorithms, we recommend to use the
 following interfaces:

 -  ``set_agents``: This function tells the ``Env`` what agents will be
   used to perform actions in the game. Different games may have a
   different number of agents. The input of the function is a list of
   ``Agent`` class. For example,
   ``env.set_agent([RandomAgent(), RandomAgent()])`` indicates that two
   random agents will be used to generate the trajectories.
 -  ``run``: After setting the agents, this interface will run a complete
   trajectory of the game, calculate the reward for each transition, and
   reorganize the data so that it can be directly fed into a RL
   algorithm.

 For advanced access to the environment, such as traversal of the game
 tree, we provide the following interfaces:

 -  ``step``: Given the current state, the environment takes one step
   forward, and returns the next state and the next player.
 -  ``step_back``: Takes one step backward. The environment will restore
   to the last state. The ``step_back`` is defaultly turned off since it
   requires expensively recoeding previous states. To turn it on, set
   ``allow_step_back = True`` when ``make`` environments.
 -  ``get_payoffs``: At the end of the game, this function can be called
   to obtain the payoffs for each player.

 We also support single-agent mode and human mode. Examples can be found
 in ``examples/``.

 -  Single agent mode: single-agent environments are developped by
   simulating other players with pre-trained models or rule-based
   models. You can enable single-agent mode by
   ``rlcard.make(ENV_ID, config={'single_agent_mode':True})``. Then the
   ``step`` function will return ``(next_state, reward, done)`` just as
   common single-agent environments. ``env.reset()`` will reset the game
   and return the first state.

 Games
 -----

 Card games usually have similar structures. We abstract some concepts in
 card games and follow the same design pattern. In this way,
 users/developers can easily dig into the code and change the rules for
 research purpose. Specifically, the following classes are used in all
 the games:
 Timeseries Processing
 ---------------------
 Time series processing provides multiple time series-specific preprocessing techniques including: seasonality/trend decomposition, time series transformation/scaling/smoothing.

 -  ``Game``: A game is defined as a complete sequence starting from one
   of the non-terminal states to a terminal state.
 -  ``Round``: A round is a part of the sequence of a game. Most card
   games can be naturally divided into multiple rounds.
 -  ``Dealer``: A dealer is responsible for shuffling and allocating a
   deck of cards.
 -  ``Judger``: A judger is responsible for making major decisions at the
   end of a round or a game.
 -  ``Player``: A player is a role who plays cards following a strategy.
 Feature Analysis
 ----------------
 Feature analysis module provides exhaustive feature extraction techniques from three aspects: Time domain, frequency domain and latent factor models.  
 There are 30 feature extraction methods including statistical methods, time series filters, spectral transformations, and matrix factorization models.

 To summarize, in one ``Game``, a ``Dealer`` deals the cards for each
 ``Player``. In each ``Round`` of the game, a ``Judger`` will make major
 decisions about the next round and the payoffs in the end of the game.
 Detection Algorithms
 ---------------------
 Based on the three scenarios above, we provide multiple algorithms including traditional approaches (e.g. IForest, Autoregression), heuristic methods (e.g. HotSax algorithm, Matrix Profile), deep learning methods (e.g. RNN-LSTM, GAN, VAE), and ensemble methods to address each kind of outlier. 

 Agents
 ------
 Reincforcement Module
 ----------------------
 Reinforcement module is designed for improve the existed model with human expertise. Specifically, rule-based filtering has been developed to allow users to transform the domain knowledge into rule filters, active learning based methods will be involved in the near future as well.

 We provide examples of several representative algorithms and wrap them
 as ``Agent`` to show how a learning algorithm can be connected to the
 toolkit. The first example is DQN which is a representative of the
 Reinforcement Learning (RL) algorithms category. The second example is
 NFSP which is a representative of the Reinforcement Learning (RL) with
 self-play. We also provide CFR and DeepCFR which belong to Conterfactual
 Regret Minimization (CFR) category. Other algorithms from these three
 categories can be connected in similar ways.
--- a/examples/build_IsolationForest_pipline.py
+++ b/examples/build_IsolationForest_pipline.py
@@ -100,8 +100,8 @@ with open('example_pipeline.json', 'w') as f:
    f.write(data)
    print(data)

 # Output to YAML
 yaml = pipeline_description.to_yaml()
 with open('pipeline.yml', 'w') as f:
    f.write(yaml)
 print(yaml)
 ## Output to YAML
 #yaml = pipeline_description.to_yaml()
 #with open('pipeline.yml', 'w') as f:
 #    f.write(yaml)
 #print(yaml)
--- a/examples/build_LODA_pipline.py
+++ b/examples/build_LODA_pipline.py
@@ -70,9 +70,3 @@ with open('example_pipeline.json', 'w') as f:
    f.write(data)
    print(data)

 # Output to YAML
 yaml = pipeline_description.to_yaml()
 with open('pipeline.yml', 'w') as f:
    f.write(yaml)
 print(yaml)

--- a/examples/evaluate_default_pipeline.py
+++ b/examples/evaluate_default_pipeline.py
@@ -5,6 +5,7 @@ from searcher.utils import generate_dataset_problem, evaluate_pipeline

 table_path = 'datasets/yahoo_sub_5.csv'
 target_index = 6 # what column is the target

 #table_path = 'datasets/NAB/realTweets/labeled_Twitter_volume_IBM.csv' # The path of the dataset
 time_limit = 30 # How many seconds you wanna search
 #metric = 'F1' # F1 on label 1