Drupal modules for browsing and managing Fedora-based digital repositories.
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
Alan Stanley ad05b37d08
Media multifile (#756)
4 years ago
..
src Media multifile (#756) 4 years ago
tests/src/Functional Add submodules to CI testing (#749) 5 years ago
CONTRIBUTING.md Responding to review 5 years ago
LICENSE Text extraction (#163) 5 years ago
README.md Linking to Alpaca in READMEs 5 years ago
islandora_text_extraction.info.yml Text extraction (#163) 5 years ago
islandora_text_extraction.module Revert "update field_edited_text when different from the file contents (#769)" (#770) 5 years ago
islandora_text_extraction.routing.yml Media multifile (#756) 4 years ago
islandora_text_extraction.services.yml Reindexing parents of extracted text when updated (#767) 5 years ago

README.md

Islandora Text Extraction

Minimum PHP Version Contribution Guidelines LICENSE

Introduction

Provides an action to extract text with a Hypercube (tessseract and pdftotext) server, as well as a Media type to hold the extracted text.

Requirements

  • islandora and islandora_core_feature
  • A Hypercube microservice
  • A message broker (e.g. Activemq) for Islandora 8
  • An instance of islandora-connector-derivative (from Alpaca) configured for Hypercube

Installation

For a full digital repository solution (including a Hypercube microservice), see our installation documentation.

To download/enable just this module, use the following from the command line:

$ composer require islandora/islandora
$ drush en islandora_core_feature
$ drush mim islandora_tags
$ drush en islandora_text_extraction

Documentation

Official documentation is available on the Islandora 8 documentation site.

Sponsors

Original work for this module was done by @ajstanley for @roblib at University of Prince Edward Island.

Development

If you would like to contribute, please get involved by attending our weekly Tech Call. We love to hear from you!

If you would like to contribute code to the project, you need to be covered by an Islandora Foundation Contributor License Agreement or Corporate Contributor License Agreement. Please see the Contributors pages on Islandora.ca for more information.

We recommend using the islandora-playbook to get started.

License

GPLv2