[ENH] add docstring for avg_confidence

1 year ago · 5b66680508
--- a/ablkit/bridge/base_bridge.py
+++ b/ablkit/bridge/base_bridge.py
@@ -36,9 +36,7 @@ class BaseBridge(metaclass=ABCMeta):

    def __init__(self, model: ABLModel, reasoner: Reasoner) -> None:
        if not isinstance(model, ABLModel):
            raise TypeError(
                f"Expected an instance of ABLModel, but received type: {type(model)}"
            )
            raise TypeError(f"Expected an instance of ABLModel, but received type: {type(model)}")
        if not isinstance(reasoner, Reasoner):
            raise TypeError(
                f"Expected an instance of Reasoner, but received type: {type(reasoner)}"
--- a/ablkit/reasoning/reasoner.py
+++ b/ablkit/reasoning/reasoner.py
@@ -30,12 +30,14 @@ class Reasoner:
        measure, wherein the candidate with lowest cost is selected as the final
        abduced label. It can be either a string representing a predefined distance
        function or a callable function. The available predefined distance functions:
        'hamming' | 'confidence'. 'hamming': directly calculates the Hamming
        distance between the predicted pseudo-label in the data example and each
        candidate, 'confidence': calculates the distance between the prediction
        and each candidate based on confidence derived from the predicted probability
        in the data example. The callable function should have the signature
        dist_func(data_example, candidates, candidate_idxs, reasoning_results) and must
        'hamming' | 'confidence' | 'avg_confidence'. 'hamming' directly calculates the
        Hamming distance between the predicted pseudo-label in the data example and each
        candidate. 'confidence' and 'avg_confidence' calculates the confidence distance
        between the predicted probabilities in the data example and each candidate, where
        the confidence distance is defined as 1 - the product of prediction probabilities
        in 'confidence' and 1 - the average of prediction probabilities in 'avg_confidence'.
        Alternatively, the callable function should have the signature
        ``dist_func(data_example, candidates, candidate_idxs, reasoning_results)`` and must
        return a cost list. Each element in this cost list should be a numerical value
        representing the cost for each candidate, and the list should have the same length
        as candidates. Defaults to 'confidence'.
--- a/docs/Intro/Reasoning.rst
+++ b/docs/Intro/Reasoning.rst
@@ -316,12 +316,14 @@ specify:
   accelerate consistency minimization. Defaults to False.
 -  ``dist_func`` (str, optional), specifying the distance function to be
   used when determining consistency between your prediction and
   candidate returned from knowledge base. Valid options include
   “confidence” (default) and “hamming”. For “confidence”, it calculates
   the distance between the prediction and candidate based on confidence
   derived from the predicted probability in the data example. For
   “hamming”, it directly calculates the Hamming distance between the
   predicted pseudo-label in the data example and candidate.
   candidate returned from knowledge base. This can be either a user-defined function
   or one that is predefined. Valid predefined options include
   “hamming”, “confidence” and “avg_confidence”. For “hamming”, it directly calculates the Hamming distance between the
   predicted pseudo-label in the data example and candidate. For “confidence”, it
   calculates the confidence distance between the predicted probabilities in the data
   example and each candidate, where the confidence distance is defined as 1 - the product
   of prediction probabilities in “confidence” and 1 - the average of prediction probabilities in “avg_confidence”.
   Defaults to “confidence”.
 - ``idx_to_label`` (dict, optional), a mapping from index in the base model to label. 
   If not provided, a default order-based index to label mapping is created. 
   Defaults to None.
@@ -357,7 +359,7 @@ As an example, consider these data examples for MNIST Addition:

 The compatible candidates after abductive reasoning for both examples
 would be ``[[1,7], [7,1]]``. However, when the reasoner calls ``abduce`` 
 to select only one candidate based on the ``confidence`` distance function, 
 to select only one candidate based on the “confidence” distance function, 
 the output would differ for each example:

 .. code:: python
@@ -373,7 +375,7 @@ Out:

      The outputs for example1 and example2 are [1,7] and [7,1], respectively.

 Specifically, as mentioned before, ``confidence`` calculates the distance between the data 
 Specifically, as mentioned before, “confidence” calculates the distance between the data 
 example and candidates based on the confidence derived from the predicted probability. 
 Take ``example1`` as an example, the ``pred_prob`` in it indicates a higher 
 confidence that the first label should be "1" rather than "7". Therefore, among the