You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.

README.md 9.7 kB

1 year ago
1 year ago
1 year ago
1 year ago
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238
  1. <div align="center">
  2. <p align="center">
  3. <img src="./docs/_static/img/logo.png" alt="ABLkit logo" style="width: 35%;"/>
  4. </p>
  5. <br>
  6. [![PyPI - Python Version](https://img.shields.io/pypi/pyversions/ablkit)](https://pypi.org/project/ablkit/)
  7. [![PyPI version](https://badgen.net/pypi/v/ablkit)](https://pypi.org/project/ablkit/)
  8. [![Documentation Status](https://readthedocs.org/projects/ablkit/badge/?version=latest)](https://ablkit.readthedocs.io/en/latest/?badge=latest)
  9. [![license](https://img.shields.io/github/license/mashape/apistatus.svg?maxAge=2592000)](https://github.com/AbductiveLearning/ABLkit/blob/main/LICENSE)
  10. [![flake8 Lint](https://github.com/AbductiveLearning/ABLkit/actions/workflows/lint.yaml/badge.svg)](https://github.com/AbductiveLearning/ABLkit/actions/workflows/lint.yaml)
  11. [![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
  12. [![ABLkit-CI](https://github.com/AbductiveLearning/ABLkit/actions/workflows/build-and-test.yaml/badge.svg)](https://github.com/AbductiveLearning/ABLkit/actions/workflows/build-and-test.yaml)
  13. [📘Documentation](https://ablkit.readthedocs.io/en/latest/index.html) |
  14. [📚Examples](https://github.com/AbductiveLearning/ABLkit/tree/main/examples) |
  15. [💬Reporting Issues](https://github.com/AbductiveLearning/ABLkit/issues/new)
  16. </div>
  17. # ABLkit: A Python Toolkit for Abductive Learning
  18. **ABLkit** is an efficient Python toolkit for **Abductive Learning (ABL)**.
  19. ABL is a novel paradigm that integrates machine learning and
  20. logical reasoning in a unified framework. It is suitable for tasks
  21. where both data and (logical) domain knowledge are available.
  22. <p align="center">
  23. <img src="./docs/_static/img/ABL.png" alt="Abductive Learning" style="width: 80%;"/>
  24. </p>
  25. Key Features of ABLkit:
  26. - **Great Flexibility**: Adaptable to various machine learning modules and logical reasoning components.
  27. - **User-Friendly**: Provide data, model, and KB, and get started with just a few lines of code.
  28. - **High-Performance**: Optimization for high accuracy and fast training speed.
  29. ABLkit encapsulates advanced ABL techniques, providing users with
  30. an efficient and convenient toolkit to develop dual-driven ABL systems,
  31. which leverage the power of both data and knowledge.
  32. <p align="center">
  33. <img src="./docs/_static/img/ABLkit.png" alt="ABLkit" style="width: 80%;"/>
  34. </p>
  35. ## Installation
  36. ### Install from PyPI
  37. The easiest way to install ABLkit is using ``pip``:
  38. ```bash
  39. pip install ablkit
  40. ```
  41. ### Install from Source
  42. Alternatively, to install from source code, sequentially run following commands in your terminal/command line.
  43. ```bash
  44. git clone https://github.com/AbductiveLearning/ABLkit.git
  45. cd ABLkit
  46. pip install -v -e .
  47. ```
  48. ### (Optional) Install SWI-Prolog
  49. If the use of a [Prolog-based knowledge base](https://ablkit.readthedocs.io/en/latest/Intro/Reasoning.html#prolog) is necessary, please also install [SWI-Prolog](https://www.swi-prolog.org/):
  50. For Linux users:
  51. ```bash
  52. sudo apt-get install swi-prolog
  53. ```
  54. For Windows and Mac users, please refer to the [SWI-Prolog Install Guide](https://github.com/yuce/pyswip/blob/master/INSTALL.md).
  55. ## Quick Start
  56. We use the MNIST Addition task as a quick start example. In this task, pairs of MNIST handwritten images and their sums are given, alongwith a domain knowledge base which contains information on how to perform addition operations. Our objective is to input a pair of handwritten images and accurately determine their sum.
  57. <details>
  58. <summary>Working with Data</summary>
  59. <br>
  60. ABLkit requires data in the format of `(X, gt_pseudo_label, Y)` where `X` is a list of input examples containing instances, `gt_pseudo_label` is the ground-truth label of each example in `X` and `Y` is the ground-truth reasoning result of each example in `X`. Note that `gt_pseudo_label` is only used to evaluate the machine learning model's performance but not to train it.
  61. In the MNIST Addition task, the data loading looks like:
  62. ```python
  63. # The 'datasets' module below is located in 'examples/mnist_add/'
  64. from datasets import get_dataset
  65. # train_data and test_data are tuples in the format of (X, gt_pseudo_label, Y)
  66. train_data = get_dataset(train=True)
  67. test_data = get_dataset(train=False)
  68. ```
  69. </details>
  70. <details>
  71. <summary>Building the Learning Part</summary>
  72. <br>
  73. Learning part is constructed by first defining a base model for machine learning. ABLkit offers considerable flexibility, supporting any base model that conforms to the scikit-learn style (which requires the implementation of fit and predict methods), or a PyTorch-based neural network (which has defined the architecture and implemented forward method). In this example, we build a simple LeNet5 network as the base model.
  74. ```python
  75. # The 'models' module below is located in 'examples/mnist_add/'
  76. from models.nn import LeNet5
  77. cls = LeNet5(num_classes=10)
  78. ```
  79. To facilitate uniform processing, ABLkit provides the `BasicNN` class to convert a PyTorch-based neural network into a format compatible with scikit-learn models. To construct a `BasicNN` instance, aside from the network itself, we also need to define a loss function, an optimizer, and the computing device.
  80. ```python
  81. ​import torch
  82. ​from ablkit.learning import BasicNN
  83. ​loss_fn = torch.nn.CrossEntropyLoss()
  84. ​optimizer = torch.optim.RMSprop(cls.parameters(), lr=0.001, alpha=0.9)
  85. ​device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
  86. ​base_model = BasicNN(model=cls, loss_fn=loss_fn, optimizer=optimizer, device=device)
  87. ```
  88. The base model built above is trained to make predictions on instance-level data (e.g., a single image), while ABL deals with example-level data. To bridge this gap, we wrap the base_model into an instance of `ABLModel`. This class serves as a unified wrapper for base models, facilitating the learning part to train, test, and predict on example-level data, (e.g., images that comprise an equation).
  89. ```python
  90. from ablkit.learning import ABLModel
  91. ​model = ABLModel(base_model)
  92. ```
  93. </details>
  94. <details>
  95. <summary>Building the Reasoning Part</summary>
  96. <br>
  97. To build the reasoning part, we first define a knowledge base by creating a subclass of `KBBase`. In the subclass, we initialize the `pseudo_label_list` parameter and override the `logic_forward` method, which specifies how to perform (deductive) reasoning that processes pseudo-labels of an example to the corresponding reasoning result. Specifically, for the MNIST Addition task, this `logic_forward` method is tailored to execute the sum operation.
  98. ```python
  99. from ablkit.reasoning import KBBase
  100. class AddKB(KBBase):
  101. def __init__(self, pseudo_label_list=list(range(10))):
  102. super().__init__(pseudo_label_list)
  103. ​ def logic_forward(self, nums):
  104. return sum(nums)
  105. kb = AddKB()
  106. ```
  107. Next, we create a reasoner by instantiating the class `Reasoner`, passing the knowledge base as a parameter. Due to the indeterminism of abductive reasoning, there could be multiple candidate pseudo-labels compatible to the knowledge base. In such scenarios, the reasoner can minimize inconsistency and return the pseudo-label with the highest consistency.
  108. ```python
  109. from ablkit.reasoning import Reasoner
  110. reasoner = Reasoner(kb)
  111. ```
  112. </details>
  113. <details>
  114. <summary>Building Evaluation Metrics</summary>
  115. <br>
  116. ABLkit provides two basic metrics, namely `SymbolAccuracy` and `ReasoningMetric`, which are used to evaluate the accuracy of the machine learning model's predictions and the accuracy of the `logic_forward` results, respectively.
  117. ```python
  118. from ablkit.data.evaluation import ReasoningMetric, SymbolAccuracy
  119. metric_list = [SymbolAccuracy(), ReasoningMetric(kb=kb)]
  120. ```
  121. </details>
  122. <details>
  123. <summary>Bridging Learning and Reasoning</summary>
  124. <br>
  125. Now, we use `SimpleBridge` to combine learning and reasoning in a
  126. unified ABL framework.
  127. ```python
  128. from ablkit.bridge import SimpleBridge
  129. bridge = SimpleBridge(model, reasoner, metric_list)
  130. ```
  131. Finally, we proceed with training and testing.
  132. ```python
  133. ​bridge.train(train_data, loops=1, segment_size=0.01)
  134. bridge.test(test_data)
  135. ```
  136. </details>
  137. To explore detailed tutorials and information, please refer to - [document](https://ablkit.readthedocs.io/en/latest/index.html).
  138. ## Examples
  139. We provide several examples in `examples/`. Each example is stored in a separate folder containing a README file.
  140. + [MNIST Addition](https://github.com/AbductiveLearning/ABLkit/tree/main/examples/mnist_add)
  141. + [Handwritten Formula (HWF)](https://github.com/AbductiveLearning/ABLkit/tree/main/examples/hwf)
  142. + [Handwritten Equation Decipherment](https://github.com/AbductiveLearning/ABLkit/tree/main/examples/hed)
  143. + [Zoo](https://github.com/AbductiveLearning/ABLkit/tree/main/examples/zoo)
  144. ## References
  145. For more information about ABL, please refer to: [Zhou, 2019](http://scis.scichina.com/en/2019/076101.pdf) and [Zhou and Huang, 2022](https://www.lamda.nju.edu.cn/publication/chap_ABL.pdf).
  146. ```latex
  147. @article{zhou2019abductive,
  148. title = {Abductive learning: towards bridging machine learning and logical reasoning},
  149. author = {Zhou, Zhi-Hua},
  150. journal = {Science China Information Sciences},
  151. volume = {62},
  152. number = {7},
  153. pages = {76101},
  154. year = {2019}
  155. }
  156. @incollection{zhou2022abductive,
  157. title = {Abductive Learning},
  158. author = {Zhou, Zhi-Hua and Huang, Yu-Xuan},
  159. booktitle = {Neuro-Symbolic Artificial Intelligence: The State of the Art},
  160. editor = {Pascal Hitzler and Md. Kamruzzaman Sarker},
  161. publisher = {{IOS} Press},
  162. pages = {353--369},
  163. address = {Amsterdam},
  164. year = {2022}
  165. }
  166. ```

An efficient Python toolkit for Abductive Learning (ABL), a novel paradigm that integrates machine learning and logical reasoning in a unified framework.