If you are interested answer the questions, send your CV, and a current transcript of records. Feel free to reach out beforehand if you have any questions.
Bachelor's or Master's Thesis with the goal to design and develop an interactive labelling system for segmentation of advertisements from scanned newspaper archives. WHO CAN APPLY? Only enrolled students from KIT (Karlsruher Institut für Technologie) with course of studies Wirtschaftsinformatik, Wirtschaftsingenieurwesen, Informationswirtschaft, or Technische Volkswirtschaftslehre.
As the digitization of the worlds libraries and print archives continues steadily, the demand for automated processing of such documents grows. Hereby, resarchers and practicioners would like to digitally process such documents with tools from computer vision (CV) and optical character recognition (OCR). Further they would like to search and filter for certain document meta-data. However, all of this presumes the availablity of such extracted features and meta-data. As state-of-the-art machine learning (ML) classifiers still do not reach desired accuracy levels, especially on old documents or those from fringe contexts, manual labeling effort is required.
For the scope of this thesis, we limit the context to segmenting advertisements from scanned pages of newspapers and magazines. This poses an interesting use-case for, for instance, advertising researchers. Associated colleagues at the University of Mannheim (UniMA) have already manually created a labeled set of 9000 segmented pages of the US magazine "The Economist", ranging from the 1840s to today. We expect a thesis student to develop an interactive labeling system in order to support the extension of this segmentation traing data-set to many more pages. Interactive labeling hereby strives to combine automatic steps (e.g. the trained model) with incremental user input. The work-packages entail:
We expect the student to be familiar with web development. The system should be devloped with a modern web application frontend framework (e.g. Vue with Vuetify) or be forked from an existing open source segmentation system. Further we expect the model to be trained based on standard Python frameworks. Experience in this regard is required as well.
Unser Jobangebot Interactive Labeling of Scan Segmentations klingt vielversprechend? Dann freuen wir uns auf eine Bewerbung über Campusjäger by Workwise.
Bei unserem Partner Campusjäger kann man sich in nur wenigen Minuten ohne Anschreiben für diesen Job bewerben und den Status der Bewerbung live verfolgen.
Bitte sage uns wo du ähnliche Stellenanzeigen suchst und vergiss nicht deine E-Mail Adresse anzugeben!