Enhancing Active Learning with Weak Supervision and Transfer Learning by Leveraging Information and Knowledge Sources
L. Rauch, D. Huseljic, and B. Sick.
Workshop on Interactive Adaptive Learning (IAL), ECML PKDD, page 27--42. (2022)

One of the major limitations of deploying a machine learning model is the availability of labeled training data and the resulting expensive annotation process. Although active learning (AL) methods may reduce the annotation cost by actively selecting the most-useful instances, a costly human annotator usually provides the labels. Therefore, even with AL, we still consider the annotation process to be time-consuming and expensive. Besides human annotators, though, companies often have a vast amount of information and knowledge sources available that can generate low-cost labels (e.g., a black-box model) or improve the learning process (e.g., a pre-trained model). We present a novel approach that enhances AL with weak supervision (WS) and transfer learning (TL) to reduce the annotation cost by leveraging these sources. Specifically, we consider a black-box model like a rule-based system as an error-prone and weakly-supervised annotator that inexpensively provides labels. We estimate its performance with an annotator model to decide whether a human annotation is required. Additionally, we utilize unlabeled internal and external data by transferring knowledge from a pre-trained model to the AL cycle. We sequentially investigate the impact of WS and TL on annotation cost and model performance in an AL cycle through a use case. Our evaluation shows that our approach can reduce annotation cost by 51% while achieving nearly identical model performance compared to a traditional AL approach.

Document

http://ceur-ws.org/Vol-3259/ialatecml_paper3.pdf

search on

This publication has not been reviewed yet.

rating distribution

average user rating0.0 out of 5.0 based on 0 reviews

Please log in to take part in the discussion (add own reviews or comments).

@inproceedings{rauch2022enhancing,
  abstract = {One of the major limitations of deploying a machine learning
model is the availability of labeled training data and the resulting 
expensive annotation process. Although active learning (AL) methods may 
reduce the annotation cost by actively selecting the most-useful instances, 
a costly human annotator usually provides the labels. Therefore, even
with AL, we still consider the annotation process to be time-consuming
and expensive. Besides human annotators, though, companies often have
a vast amount of information and knowledge sources available that can
generate low-cost labels (e.g., a black-box model) or improve the learning process 
(e.g., a pre-trained model). We present a novel approach that
enhances AL with weak supervision (WS) and transfer learning (TL) to
reduce the annotation cost by leveraging these sources. Specifically, we
consider a black-box model like a rule-based system as an error-prone
and weakly-supervised annotator that inexpensively provides labels. We
estimate its performance with an annotator model to decide whether a
human annotation is required. Additionally, we utilize unlabeled internal
and external data by transferring knowledge from a pre-trained model
to the AL cycle. We sequentially investigate the impact of WS and TL
on annotation cost and model performance in an AL cycle through a use
case. Our evaluation shows that our approach can reduce annotation cost
by 51% while achieving nearly identical model performance compared to
a traditional AL approach.},
  added-at = {2022-11-03T09:45:46.000+0100},
  author = {Rauch, Lukas and Huseljic, Denis and Sick, Bernhard},
  biburl = {https://puma.uni-kassel.de/bibtex/2d3548273919b496cea4ee625e82e20b9/04068750},
  booktitle = {Workshop on Interactive Adaptive Learning (IAL), ECML PKDD},
  interhash = {19dc16205a290b09bdc7040f1329b777},
  intrahash = {d3548273919b496cea4ee625e82e20b9},
  keywords = {Active-Learning Weak-Supervision Transfer-Learning Information-and-Knowledge-Sources imported itegpub isac-www},
  pages = {27--42},
  timestamp = {2022-11-03T09:45:46.000+0100},
  title = {Enhancing Active Learning with Weak Supervision and Transfer Learning by Leveraging Information and Knowledge Sources},
  url = {http://ceur-ws.org/Vol-3259/ialatecml_paper3.pdf},
  year = 2022
}

%0 Conference Paper
%1 rauch2022enhancing
%A Rauch, Lukas
%A Huseljic, Denis
%A Sick, Bernhard
%B Workshop on Interactive Adaptive Learning (IAL), ECML PKDD
%D 2022
%K Active-Learning Weak-Supervision Transfer-Learning Information-and-Knowledge-Sources imported itegpub isac-www
%P 27--42
%T Enhancing Active Learning with Weak Supervision and Transfer Learning by Leveraging Information and Knowledge Sources
%U http://ceur-ws.org/Vol-3259/ialatecml_paper3.pdf
%X One of the major limitations of deploying a machine learning
model is the availability of labeled training data and the resulting 
expensive annotation process. Although active learning (AL) methods may 
reduce the annotation cost by actively selecting the most-useful instances, 
a costly human annotator usually provides the labels. Therefore, even
with AL, we still consider the annotation process to be time-consuming
and expensive. Besides human annotators, though, companies often have
a vast amount of information and knowledge sources available that can
generate low-cost labels (e.g., a black-box model) or improve the learning process 
(e.g., a pre-trained model). We present a novel approach that
enhances AL with weak supervision (WS) and transfer learning (TL) to
reduce the annotation cost by leveraging these sources. Specifically, we
consider a black-box model like a rule-based system as an error-prone
and weakly-supervised annotator that inexpensively provides labels. We
estimate its performance with an annotator model to decide whether a
human annotation is required. Additionally, we utilize unlabeled internal
and external data by transferring knowledge from a pre-trained model
to the AL cycle. We sequentially investigate the impact of WS and TL
on annotation cost and model performance in an AL cycle through a use
case. Our evaluation shows that our approach can reduce annotation cost
by 51% while achieving nearly identical model performance compared to
a traditional AL approach.

PUMA

Enhancing Active Learning with Weak Supervision and Transfer Learning by Leveraging Information and Knowledge Sources
L. Rauch, D. Huseljic, and B. Sick.
Workshop on Interactive Adaptive Learning (IAL), ECML PKDD, page 27--42. (2022)

Tags

Users

Comments and Reviews

Cite this publication

PUMA

Enhancing Active Learning with Weak Supervision and Transfer Learning by Leveraging Information and Knowledge SourcesL. Rauch, D. Huseljic, and B. Sick. Workshop on Interactive Adaptive Learning (IAL), ECML PKDD, page 27--42. (2022)

Tags

Users

Comments and Reviews

Cite this publication

Enhancing Active Learning with Weak Supervision and Transfer Learning by Leveraging Information and Knowledge Sources
L. Rauch, D. Huseljic, and B. Sick.
Workshop on Interactive Adaptive Learning (IAL), ECML PKDD, page 27--42. (2022)