What is the INESData linguistic dataspace?

The EDL linguistic dataspace is a secure infrastructure for accessing, using, and sharing data in the field of language technologies in Spanish and co-official languages. EDL follows the guidelines of the European data strategy, ensuring data privacy and sovereignty where individuals and companies have full control over their data.


Linguistic Resources

In the EDL, linguistic resources can be published including:

  • Corpora
  • Lexical resources
  • Annotated datasets
  • Pre-trained language models

  • Providers

    Organizations and individuals can make linguistic resources available to EDL users. Resources can be available on the provider's infrastructure, for example, at an HTTP address, or in the INESData cloud storage system.

    Providers can assign access policies that limit who has visibility over the resources, and contracting policies that are validated when a consumer user initiates the contracting of a resource.


    Consumers

    Organizations and individuals interested in linguistic resources can negotiate linguistic resources with the provider.

    Once the negotiation is completed, resources are transferred directly to the consumer user using the HTTP protocol or to the INESData cloud storage system.


    INESData Cloud Storage

    INESData provides EDL users with the MinIO cloud storage system.

    MinIO is compatible with the Amazon S3 API, which facilitates its integration with other S3-compatible services.


    EDL Catalog

    Linguistic resources published by users of the dataspace will be visible to other users in the EDL catalog if they meet the access conditions of the resources.

    The catalog will allow users to initiate the negotiation of resources.