Browse Source

Update docs/start/intro.rst

tags/v0.3.2
昊一 雷 3 years ago
parent
commit
5159f3ea25
1 changed files with 67 additions and 28 deletions
  1. +67
    -28
      docs/start/intro.rst

+ 67
- 28
docs/start/intro.rst View File

@@ -1,47 +1,86 @@
================
Background
Introduction
================

Learnware is an AI-oriented paradigm aimed at empowering users to create versatile and efficient machine learning models. By reusing small models for tasks beyond their original purposes, Learnware enables users to experiment with and develop better strategies without building models from scratch efficient and safe.

=================
Motivation
=================

Machine learning, especially the prevailing big model paradigm, has achieved great success in natural language processing and computer vision applications. However, it still faces challenges such as the requirement of a large amount of labeled training data, difficulty in adapting to changing environments, and catastrophic forgetting when refining trained models incrementally. These big models, while useful in their targeted tasks, often fail to address the above issues and struggle to generalize beyond their specific purposes.

To better address the entangled issues in machine learning, we should consider the following aspects:

1. Investigate techniques that address multiple challenges simultaneously, recognizing that these issues are often intertwined in real-world applications.
2. Explore paradigms like learnware, which offers the possibility of systematically reusing small models for tasks beyond their original purposes, reducing the need for users to build models from scratch.
3. Develop solutions that enable ordinary users to create well-performing models without requiring proficient training skills.
4. Address data privacy and proprietary concerns to facilitate experience sharing among different users while respecting confidentiality.
5. Adapt to the constraints of big data applications, where it may be unaffordable or infeasible to hold all data for multiple passes of scanning.
6. Consider the environmental impact of training large models, as their carbon emissions pose a threat to our environment.
+------------------------------------------------------------------------------------+
| Aspect |
+====================================================================================+
| 1. Investigate techniques that address multiple challenges simultaneously, |
| recognizing that these issues are often intertwined in real-world applications.|
+------------------------------------------------------------------------------------+
| 2. Explore paradigms like learnware, which offers the possibility of |
| systematically reusing small models for tasks beyond their original purposes, |
| reducing the need for users to build models from scratch. |
+------------------------------------------------------------------------------------+
| 3. Develop solutions that enable ordinary users to create well-performing models |
| without requiring proficient training skills. |
+------------------------------------------------------------------------------------+
| 4. Address data privacy and proprietary concerns to facilitate experience |
| sharing among different users while respecting confidentiality. |
+------------------------------------------------------------------------------------+
| 5. Adapt to the constraints of big data applications, where it may be |
| unaffordable or infeasible to hold all data for multiple passes of scanning. |
+------------------------------------------------------------------------------------+
| 6. Consider the environmental impact of training large models, as their carbon |
| emissions pose a threat to our environment. |
+------------------------------------------------------------------------------------+

By considering these factors, we can develop a more comprehensive framework for tackling the complex challenges in machine learning, moving beyond the limitations of the big model paradigm, called Learnware.

.. contents:: Table of Contents

================
Introduction
================
=======================
Framework
=======================

Learnware is an AI-oriented paradigm aimed at empowering users to create versatile and efficient machine learning models. By reusing small models for tasks beyond their original purposes, Learnware enables users to experiment with and develop better strategies without building models from scratch.

The learnware paradigm introduces the concept of a well-performed, trained machine learning model with a specification that allows future users, who have no prior knowledge of the learnware, to reuse it based on their requirements.

================
Framework
================
The Learnware framework consists of the following components, designed as loosely-coupled modules that can be used independently:
Developers or owners of trained machine learning models can submit their models to a learnware market. If accepted, the market assigns a specification to the model and accommodates it. The learnware market could host thousands or millions of well-performed models from different developers, for various tasks, using diverse data, and optimizing different objectives.

Instead of building a model from scratch, users can submit their requirements to the learnware market, which then identifies and deploys helpful learnware(s) based on the specifications. Users can apply the learnware directly, adapt it using their data, or exploit it in other ways to improve their model. This process is more efficient and less expensive than building a model from scratch.

Name Description
-------------------------------- --------------------------------------------------------------------------------
Infrastructure layer Provides underlying support for machine learning research, including high-performance data management and retrieval and flexible model training control.
Learning Framework layer Contains learnable Forecast Models and Trading Agents that leverage reinforcement learning and supervised learning paradigms, integrated with the Workflow layer for information extraction and environment creation.
Workflow layer Covers the entire machine learning workflow, supporting both supervised-learning-based strategies and RL-based strategies, including Information Extractor, Forecast Model, Decision Generator, and Execution Environment.
Interface layer Presents a user-friendly interface for the underlying system, providing detailed analysis reports of forecasting signals, models, and execution results.
Benefits of the Learnware Paradigm
----------------------------------

The Learnware framework may be intimidating for new users, but it accurately captures the details of the design. Users new to Learnware can initially skip it and revisit it later.
+-----------------------+-----------------------------------------------------------------------------------------------+
| Benefit | Description |
+=======================+===============================================================================================+
| When lack of training data | Strong models can be built with small data by adapting well-performed learnwares. |
+-----------------------+-----------------------------------------------------------------------------------------------+
| When lack of training | Ordinary users can obtain strong models by leveraging well-performed learnwares instead of |
| skills | building models from scratch. |
+-----------------------+-----------------------------------------------------------------------------------------------+
| Catastrophic | Accepted learnwares are always stored in the learnware market, retaining old knowledge. |
| forgetting | |
+-----------------------+-----------------------------------------------------------------------------------------------+
| Continual learning | The learnware market continually enriches its knowledge with constant submissions of |
| | well-performed learnwares. |
+-----------------------+-----------------------------------------------------------------------------------------------+
| Data privacy/ | Developers only submit models, not data, preserving data privacy/proprietary. |
| proprietary | |
+-----------------------+-----------------------------------------------------------------------------------------------+
| Unplanned tasks | Open to all legal developers, the learnware market can accommodate helpful learnwares for |
| | various tasks. |
+-----------------------+-----------------------------------------------------------------------------------------------+
| Carbon emission | Assembling small models may offer good-enough performance, reducing interest in training |
| | large models and the carbon footprint. |
+-----------------------+-----------------------------------------------------------------------------------------------+

.. note::
The modules with hand-drawn style are under development and will be released in the future.
The modules with dashed borders are highly user-customizable and extendible.
Challenges and Future Work
--------------------------

.. tip::
The framework image is created with https://draw.io/
Although the learnware proposal shows promise, much work remains to make it a reality. The next sections will present some of the progress made so far.
.. contents:: Table of Contents




Loading…
Cancel
Save