Drupal modules for browsing and managing Fedora-based digital repositories.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Rosie Le Faive 5e958a5e10 Allow application/xml in OCR Action. 4 months ago
..
src Allow application/xml in OCR Action. 4 months ago
tests/src/Functional Drupal Rector. 1 year ago
CONTRIBUTING.md Continue branch renaming. 3 years ago
LICENSE Text extraction (#163) 5 years ago
README.md Continue branch renaming. 3 years ago
islandora_text_extraction.info.yml Drupal 10 Compatibility from Upgrade Status 1 year ago
islandora_text_extraction.module Issue #941: Only add <br/> tags to plain text extracted text fields. (#942) 1 year ago
islandora_text_extraction.routing.yml Media multifile (#756) 4 years ago
islandora_text_extraction.services.yml D9 islandora (#809) 4 years ago

README.md

Islandora Text Extraction

Minimum PHP Version Contribution Guidelines LICENSE

Introduction

Provides an action to extract text with a Hypercube (tesseract and pdftotext) server, as well as a Media type to hold the extracted text.

Requirements

  • islandora and islandora_core_feature
  • A Hypercube microservice
  • A message broker (e.g. Activemq) for Islandora 8
  • An instance of islandora-connector-derivative (from Alpaca) configured for Hypercube

Installation

For a full digital repository solution (including a Hypercube microservice), see our installation documentation.

To download/enable just this module, use the following from the command line:

$ composer require islandora/islandora
$ drush en islandora_core_feature
$ drush mim islandora_tags
$ drush en islandora_text_extraction

Documentation

Official documentation is available on the Islandora 8 documentation site.

Sponsors

Original work for this module was done by @ajstanley for @roblib at University of Prince Edward Island.

Development

If you would like to contribute, please get involved by attending our weekly Tech Call. We love to hear from you!

If you would like to contribute code to the project, you need to be covered by an Islandora Foundation Contributor License Agreement or Corporate Contributor License Agreement. Please see the Contributors pages on Islandora.ca for more information.

We recommend using the islandora-playbook to get started.

License

GPLv2