Semantic Mapping Vocabulary

Metadata

IRI
http://w3id.org/semapv/vocab/semapv.owl
Title

Semantic Mapping Vocabulary

License
https://creativecommons.org/publicdomain/zero/1.0/
Version Iri
http://w3id.org/semapv/vocab/releases/2026-06-02/semapv.owl
Description

The Semantic Mapping Vocabulary provides and defines terms used for creating and maintaining semantic mappings, in particular mapping metadata.

Classes

agent-based matching process c

IRI https://w3id.org/semapv/vocab/AgentBasedMatching
Description
  • Distinguished from single-step LLM-based matching by the agent's autonomy over its own process. Distinguished from composite matching, which combines heterogeneous matching techniques under a fixed pipeline, by the agent's runtime control over what to do next. In current practice, agent-based matchers are predominantly driven by large language models, but this class does not commit to a particular agent technology — a specific process may also be classified as semapv:LLMBasedMatching (or any other matching class) where appropriate.

  • A matching process in which an autonomous agent plans and executes a multi-step workflow — typically involving tool use, retrieval, intermediate reasoning, and self-evaluation — to determine matches between entities. The agent decides at runtime what information to gather, which techniques to apply, and when to stop.

Example
An agent is given a pair of candidate entities and autonomously decides to look up their labels, query an external knowledge base, retrieve definitions, compare them with a similarity measure, and revise its judgement before producing a final match decision; the OntoAligner toolkit provides agentic matching of this kind.
Source https://doi.org/10.48550/arXiv.2503.21902
Sub Class Of matching process c

background knowledge-based matching process c

IRI https://w3id.org/semapv/vocab/BackgroundKnowledgeBasedMatching
Description

A matching process that exploits background knowledge from external resources, commonly referred to as background knowledge resources. This approach is also known as indirect matching, BK-based matching or context-based matching.

Example
A match between a subject A and an object B was established because they appear equivalent under consideration of externally provided background knowledge.
Source https://doi.org/10.1016/j.websem.2018.04.001
Sub Class Of matching process c

blank normalisation c

IRI https://w3id.org/semapv/vocab/BlankNormalisation
Description

A normalization technique replacing all blank characters, such as space, tabulation, carriage return (or sequences of these) into a single blank character.

Example
The string 'ontology  matching' (two spaces between 'ontology' and 'matching') becomes 'ontology matching' (one space).
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

blocking c

IRI https://w3id.org/semapv/vocab/Blocking
Description
  • Blocking is conceptually distinct from matching: a blocker is recall-oriented and typically encodes each entity independently of the others, whereas a matcher is precision-oriented and decides about a specific candidate pair. Many modern pipelines combine an embedding-based blocker with a transformer- or LLM-based matcher.

  • A mapping activity that, given two sets of input entities, produces a reduced set of candidate entity pairs likely to match — favouring recall over precision — to be passed on to a downstream matching process.

Example
Out of 10^7 possible subject–object pairs, a blocker returns 10^5 candidate pairs by applying inexpensive comparisons such as token overlap, hash buckets, or embedding-space nearest-neighbour search.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of mapping activity c
Super Class Of embedding-based blocking c

bounded path matching c

IRI https://w3id.org/semapv/vocab/BoundedPathMatching
Description
  • Described as a structural matching technique in Euzenat & Shvaiko, "Ontology Matching" (2nd ed., 2013), and implemented in classical structural OM systems such as Anchor-PROMPT.

  • A structural matching process based on the comparison of the matched super- and subclass paths (up to a bounded depth) of two candidate entities.

Example
Two classes are matched because their direct superclasses and subclasses (within a bounded depth) are already paired by other matchers, and the structural agreement among neighbours raises confidence in the match.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of structural matching process c

cardinality filtering c

IRI https://w3id.org/semapv/vocab/CardinalityFiltering
Sub Class Of matching process post-processing c

case normalization c

IRI https://w3id.org/semapv/vocab/CaseNormalization
Description

A normalization technique converting each alphabetic character in a string to their lower case counterpart.

Example
The string 'Ontology' is normalised to 'ontology' (lower case).
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

composite matching process c

IRI https://w3id.org/semapv/vocab/CompositeMatching
Description

A matching process based on multiple, possibly intertwined, matching approaches.

Example
An ontology matching tool determines that a subject should be mapped to an object by employing a range of techniques, including lexical, semantic and structural.
Sub Class Of matching process c

diacritics suppression c

IRI https://w3id.org/semapv/vocab/DiacriticsSuppression
Description

A normalization technique replacing diactritic signs (e.g. accents, hats) with their most frequent replacements.

Example
The string 'Protégé' is normalised to 'Protege'.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

digit suppression c

IRI https://w3id.org/semapv/vocab/DigitSuppression
Description

A normalization technique removing all numbers in a string.

Example
The string 'ontology1' becomes 'ontology'.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

embedding-based blocking c

IRI https://w3id.org/semapv/vocab/EmbeddingBasedBlocking
Description

A blocking process in which entities are encoded independently as vector embeddings and candidate pairs are produced by nearest-neighbour search or similarity-threshold filtering in the embedding space.

Example
A sentence-transformer encodes every subject and object label into a vector; an approximate nearest-neighbour index (e.g. FAISS) returns, for each object, the top-k subjects whose embeddings are closest in cosine distance, as the set of candidate pairs to be matched downstream.
Source https://doi.org/10.48550/arXiv.2310.11244
Sub Class Of blocking c

embedding-based matching process c

IRI https://w3id.org/semapv/vocab/EmbeddingBasedMatching
Description
  • Where embedding similarity is used only to generate candidate pairs for a downstream matcher (rather than to make the final match decision), the process is more accurately classified as semapv:EmbeddingBasedBlocking.

  • A machine learning-based matching process in which matches are identified by comparing learned vector representations (embeddings) of entities under a vector similarity measure.

Example
A sentence-transformer encodes the label of each candidate entity; the pair is reported as a match when the cosine similarity between the two embeddings exceeds a fixed threshold.
Source https://doi.org/10.48550/arXiv.2310.11244
Sub Class Of machine learning-based matching process c
Super Class Of

graph representation learning-based matching process c

IRI https://w3id.org/semapv/vocab/GraphRepresentationLearningBasedMatching
Description

An embedding-based matching process in which matches are identified by comparing learned vector representations that encode the structural and semantic features of entities and their surrounding graph context.

Example
A graph embedding method such as node2vec, GraphSAGE, or OWL2Vec* encodes each entity together with its neighbourhood in the ontology graph; matches are produced where the resulting embeddings are close under a similarity measure.
Source https://doi.org/10.48550/arXiv.2503.21902
Sub Class Of embedding-based matching process c

hamming distance c

IRI https://w3id.org/semapv/vocab/HammingDistance
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of string-based similarity measure c

instance-based matching process c

IRI https://w3id.org/semapv/vocab/InstanceBasedMatching
Description

A matching process based on individual representations (or instances).

Example
A match between a subject A and an object B was established because they share the same instances.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of matching process c

LLM-based matching process c

IRI https://w3id.org/semapv/vocab/LLMBasedMatching
Description
  • Where an LLM is used only to produce embeddings that are then compared by similarity, the process is an embedding-based (transformer-based) matching process rather than an LLM-based one. The classifier is how the model is used, not which model. Multi-step, tool-using, autonomous workflows driven by an LLM are captured by semapv:AgentBasedMatching.

  • A machine learning-based matching process in which a large language model determines matches between entities, typically through single-step or few-shot prompting, or through fine-tuning the model to judge mapping candidates or generate mappings directly.

Example
OLaLa prompts a large language model with the labels and surrounding context of two candidate entities and uses the model's yes/no decision as the match outcome.
Source https://doi.org/10.1145/3587259.3627571
Sub Class Of machine learning-based matching process c

language-based similarity measure c

IRI https://w3id.org/semapv/vocab/LanguageBasedSimilarityMeasure
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of similarity measure c

lemmatization c

IRI https://w3id.org/semapv/vocab/Lemmatization
Description

The process of reducing the different forms of a word to one single form.

Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

levenshtein distance c

IRI https://w3id.org/semapv/vocab/LevenshteinEditDistance
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of edit distance c

lexical matching process c

IRI https://w3id.org/semapv/vocab/LexicalMatching
Description

A matching process based on a lexical comparison between one or more syntactic features of the subject with one or more syntactic features of the object.

Example
The label of a subject entity matches to an exact synonym of an object entity.
Sub Class Of matching process c

lexical similarity threshold-based matching process c

IRI https://w3id.org/semapv/vocab/LexicalSimilarityThresholdMatching
Description

A lexical matching process based on a minimum threshold of a score from a comparison based on a lexical similarity algorithm.

Example
A match between a subject and an object was established because they had a Levenshtein score higher than 0.8.
Sub Class Of matching process c

link stripping c

IRI https://w3id.org/semapv/vocab/LinkStripping
Description

A normalization technique replacing specific links between words, such as apostrophes, dashes, underscore, etc into dashes or blanks.

Example
The string 'alzheimer's' becomes 'alzheimers'.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

logical consistency filtering c

IRI https://w3id.org/semapv/vocab/LogicalConsistencyFiltering
Sub Class Of matching process post-processing c

logical reasoning matching process c

IRI https://w3id.org/semapv/vocab/LogicalReasoning
Description

A matching process based on the inferences made by a logical reasoner.

Example
Two classes across ontologies are determined equivalent by an OWL reasoner such as ELK.
Sub Class Of matching process c

machine learning-based matching process c

IRI https://w3id.org/semapv/vocab/MachineLearningBasedMatching
Description

A matching process in which matches between entities are predicted by machine learning models trained on labelled data or learned representations.

Example
A trained machine learning model classifies a pair of candidate entities as matching or not based on learned features; the OntoAligner toolkit packages a number of such models.
Source https://doi.org/10.1145/511446.511532
Sub Class Of matching process c
Super Class Of

manual mapping curation c

IRI https://w3id.org/semapv/vocab/ManualMappingCuration
Description

A matching process that is performed by a human agent and is based on human judgement and domain knowledge.

Example
A human curator determines that a subject should be mapped to an object by virtue of their domain expertise.
Sub Class Of matching process c

mapping c

IRI https://w3id.org/semapv/vocab/Mapping
Description

A triple comprising a subject entity s, an object entity o and a mapping predicate p.

Example
The subject entity NCI:C9305 is mapped to the object entity ICD10:C80.9 using the skos:relatedMatch mapping predicate.

mapping activity c

IRI https://w3id.org/semapv/vocab/MappingActivity
Description

A process that relates to the creation, confirmation, rejection or curation of a mapping.

Example
Matching is a mapping activity that results in the creating of a mapping; mapping review is an activity that results in the confirmation of a mapping.
Super Class Of

mapping chaining-based matching process c

IRI https://w3id.org/semapv/vocab/MappingChaining
Description

A matching process based on the traversing of multiple mappings.

Example
A match between a subject A and an object B was established because A was mapped to C, C was mapped to D and D was mapped to B.
Sub Class Of matching process c

mapping inversion-based matching process c

IRI https://w3id.org/semapv/vocab/MappingInversion
Description

A matching process based on the inverting or flipping of the subject with the object of a mapping in accordance with the semantics of the mapping predicate.

Example
A broad match between a subject A and an object B using the skos:narrowMatch mapping relation was established because B was mapped to A using the skos:broadMatch mapping relation.
Source https://mapping-commons.github.io/sssom/chaining_rules/
Sub Class Of matching process c

mapping review c

IRI https://w3id.org/semapv/vocab/MappingReview
Description

A process that is concerned with determining if a mapping “candidate” (otherwise determined) is reasonable/correct.

Example
A match between a subject A and an object B was established elsewhere, and a human reviewer determined that the mapping is true (or false) based on an independent evaluation.
Sub Class Of mapping activity c

matching process post-processing c

IRI https://w3id.org/semapv/vocab/MatchingPostProcessing
Description

A process applied to a set of mappings, usually with the intention of changing it.

Sub Class Of mapping activity c
Super Class Of

matching process pre-processing c

IRI https://w3id.org/semapv/vocab/MatchingPreprocessing
Description

A preliminary processing of inputs prior to performing matching.

Sub Class Of mapping activity c
Super Class Of normalization c

punctuation elimination c

IRI https://w3id.org/semapv/vocab/PunctuationElemination
Description

A normalization technique removing all punctation characters from a string.

Example
The string 'e.g.' becomes 'eg'.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

regex removal c

IRI https://w3id.org/semapv/vocab/RegexRemoval
Description

A preprocessing method transforming a string by matching a regular expression and then removing that match.

Example
The regex match “[ -]phenotype” is removed from the label field of the subject entity in the mapping.
Sub Class Of normalization c

regex replacement c

IRI https://w3id.org/semapv/vocab/RegexReplacement
Description

A preprocessing method transforming a string by matching a regular expression and then replacing that match with a specified substitution string.

Example
The regex match “[ -]phenotype” is replaced by “-disease” for the label field of the subject entity in the mapping.
Sub Class Of normalization c

semantic similarity threshold-based matching process c

IRI https://w3id.org/semapv/vocab/SemanticSimilarityThresholdMatching
Description

A matching process based on a minimum threshold of a score from a comparison based on a semantic similarity algorithm.

Example
A match between a subject and an object entity was established because they had a Jaccard score higher than 0.8 based on the set of (common) superclasses.
Sub Class Of matching process c

similarity measure c

IRI https://w3id.org/semapv/vocab/SimilarityMeasure
Description

A technique for determining a score that characterises the similarity between two entities.

Source https://doi.org/10.1007/978-3-642-38721-0
Super Class Of

stable marriage filtering c

IRI https://w3id.org/semapv/vocab/StableMarriageFiltering
Sub Class Of matching process post-processing c

stemming c

IRI https://w3id.org/semapv/vocab/Stemming
Description

The process of reducing a word to its word stem.

Sub Class Of normalization c

stop-word removal c

IRI https://w3id.org/semapv/vocab/StopWordRemoval
Description

A preprocessing method transforming a string by removing a list of stop words.

Example
The stop-words “of, and, is, a” are removed from the label field of the subject entity in the mapping.
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

string-based similarity measure c

IRI https://w3id.org/semapv/vocab/StringBasedSimilarityMeasure
Description

A similarity measure based on the comparison of strings.

Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of similarity measure c
Super Class Of

string equality c

IRI https://w3id.org/semapv/vocab/StringEquality
Description

A string-based similarity measure which involves determining if two strings associated with mapping entities are equal.

Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of string-based similarity measure c

structural matching process c

IRI https://w3id.org/semapv/vocab/StructuralMatching
Description

Structural matching does not involve looking at "values" of properties.

Example
A match between a subject and an object was established because of the similarity of their structural features, e.g., the number of direct property of a class.
Source https://doi.org/10.1016/j.websem.2009.11.002
Sub Class Of matching process c
Super Class Of bounded path matching c

substring similarity c

IRI https://w3id.org/semapv/vocab/SubstringSimilarity
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of string-based similarity measure c

substring test c

IRI https://w3id.org/semapv/vocab/SubstringTest
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of string-based similarity measure c

term extraction c

IRI https://w3id.org/semapv/vocab/TermExtraction
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

threshold filtering c

IRI https://w3id.org/semapv/vocab/ThresholdFiltering
Sub Class Of matching process post-processing c

token-based distance c

IRI https://w3id.org/semapv/vocab/TokenBasedDistance
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of string-based similarity measure c

tokenization c

IRI https://w3id.org/semapv/vocab/Tokenization
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of normalization c

transformer-based matching process c

IRI https://w3id.org/semapv/vocab/TransformerBasedMatching
Description
  • Transformer models are a deep learning architecture designed to handle sequential data, particularly for natural language processing tasks.

  • An embedding-based matching process that uses transformer models to produce contextual vector representations of entity features for similarity comparison.

Example
BERTMap fine-tunes BERT on synonym pairs derived from the input ontologies and matches classes whose fine-tuned contextual embeddings have high cosine similarity.
Source https://doi.org/10.48550/arXiv.2112.02682
Sub Class Of embedding-based matching process c

unspecified matching process c

IRI https://w3id.org/semapv/vocab/UnspecifiedMatching
Description

A matching process based on an unspecified comparison.

Example
A mapping between a subject and an object was established, but it is no longer clear how or why.
Sub Class Of matching process c

ngram similarity c

IRI https://w3id.org/semapv/vocab/nGramSimilarity
Source https://doi.org/10.1007/978-3-642-38721-0
Sub Class Of string-based similarity measure c

Annotation Properties

description ap

IRI http://purl.org/dc/terms/description

license ap

IRI http://purl.org/dc/terms/license

source ap

IRI http://purl.org/dc/terms/source

title ap

IRI http://purl.org/dc/terms/title

alt label ap

IRI http://www.w3.org/2004/02/skos/core#altLabel

broad match ap

IRI http://www.w3.org/2004/02/skos/core#broadMatch
Is Defined By skos:
Description

A match where the object is a broader concept than the subject.

Source https://www.w3.org/TR/skos-reference/
Sub Property Of non-isomorphic match ap
Super Property Of cross-species broad match ap

close match ap

IRI http://www.w3.org/2004/02/skos/core#closeMatch
Is Defined By skos:
Description

A match where the subject and object are sufficiently similar that they can be used interchangeably in some information retrieval applications.

Source https://www.w3.org/TR/skos-reference/
Sub Property Of is in mapping relation with ap
Super Property Of

definition ap

IRI http://www.w3.org/2004/02/skos/core#definition

exact match ap

IRI http://www.w3.org/2004/02/skos/core#exactMatch
Is Defined By skos:
Description

A match where the subject and object are sufficiently similar, with a high degree of confidence, that they can be used interchangeably across a wide range of information retrieval applications.

Source https://www.w3.org/TR/skos-reference/
Sub Property Of

example ap

IRI http://www.w3.org/2004/02/skos/core#example

mapping relation ap

IRI http://www.w3.org/2004/02/skos/core#mappingRelation
Is Defined By skos:
Description

A mapping (alignment) link between two concepts.

Source https://www.w3.org/TR/skos-reference/
Super Property Of

narrow match ap

IRI http://www.w3.org/2004/02/skos/core#narrowMatch
Is Defined By skos:
Description

A match where the object is a narrower concept that the subject.

Source https://www.w3.org/TR/skos-reference/
Sub Property Of non-isomorphic match ap
Super Property Of cross-species narrow match ap

pref label ap

IRI http://www.w3.org/2004/02/skos/core#prefLabel

related match ap

IRI http://www.w3.org/2004/02/skos/core#relatedMatch
Is Defined By skos:
Description

A match where the subject and the object are inherently related, but where none is in any way more general than the other.

Source https://www.w3.org/TR/skos-reference/
Sub Property Of non-isomorphic match ap
Super Property Of cross-species exact match ap

cross-species broad match ap

IRI https://w3id.org/semapv/vocab/crossSpeciesBroadMatch
Description

A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a broader concept.

Example
FBbt:00007091 (D. melanogaster “subperineurial glial sheath”) and UBERON:0000202 (taxon-neutral “glial blood brain barrier”) are a cross-species broad match.
Source https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3
Sub Property Of has broader match ap

cross-species close match ap

IRI https://w3id.org/semapv/vocab/crossSpeciesCloseMatch
Description

A match where the subject and the object belong in different taxonomic groupings, but refer to concepts similar enough that they can be used interchangeably.

Source https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3
Sub Property Of

cross-species exact match ap

IRI https://w3id.org/semapv/vocab/crossSpeciesExactMatch
Description

A match where the subject is considered analogous to the object in a different taxonomic grouping.

Example
FBbt:00005074 (D. melanogaster “muscle cell”) and CL:0000187 (taxon-neutral “muscle cell”) are a cross-species exact match.
Source https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3
Sub Property Of

cross-species narrow match ap

IRI https://w3id.org/semapv/vocab/crossSpeciesNarrowMatch
Description

A match where the subject is considered analogous to the subject in a different taxonomic grouping, but the object refers to a narrower concept.

Source https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3
Sub Property Of has narrower match ap

isomorphic match ap

IRI https://w3id.org/semapv/vocab/isomorphicMatch
Description

A match where the subject is isomorphic to the object, i.e. considered of identical or similar form, shape, or structure.

Source https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3
Sub Property Of is in mapping relation with ap
Super Property Of

non-isomorphic match ap

IRI https://w3id.org/semapv/vocab/nonIsomorphicMatch
Description

A match where the subject cannot be considered isomorphic to the object, i.e. considered of identical or similar form, shape, or structure. The object corresponds to exactly one subject in the subject_source.

Source https://github.com/mapping-commons/semantic-mapping-vocabulary/issues/3
Sub Property Of is in mapping relation with ap
Super Property Of

Namespaces

dcterms
http://purl.org/dc/terms/
doi
https://doi.org/
ns1
http://w3id.org/semapv/vocab/
owl
http://www.w3.org/2002/07/owl#
rdf
http://www.w3.org/1999/02/22-rdf-syntax-ns#
rdfs
http://www.w3.org/2000/01/rdf-schema#
semapv
https://w3id.org/semapv/
semapv_voc
https://w3id.org/semapv/vocab/
skos
http://www.w3.org/2004/02/skos/core#

Legend

c Classes
ap Annotation Properties

made by p y LODE 3.4.4 with the OntPub profile

Table of Contents