Browse Definitions :
Definition

SPARQL

SPARQL is a declarative programming language and protocol for graph database analytics. SPARQL has the capability to perform all the analytics that SQL can perform, plus it can be used for semantic analysis, the examination of relationships. This makes it useful for performing analytics on data sets that have both structured and unstructured data. SPARQL allows users to perform analytics on information stored in a relational database, as well as friend-of-a-friend (FOAF) relationships, PageRank and shortest path.

SPARQL was conceived and defined by a W3C standards committee to perform analysis on the Semantic Web or a semantic network (knowledge graph). SPARQL takes advantage of the relationship information (semantic layer) that is inherent in the Resource Description Framework (RDF) to gain insights into correlations between objects.

Today, SPARQL is the only semantic query language that is a standard with the W3C. As such, commercial organizations and governments have standardized on SPARQL as a language and RDF as a data model to build industry models such as Financial Industry Business Ontology (FIBO) in financial services industry, Clinical Data Interchange Standards Consortium (CDISC) in pharmaceutical and HL7/FHIR in healthcare.

SPARQL vs. SQL

SPARQL shares many concepts with SQL. For example, in both languages, the analyst would use SELECT statements and WHERE clauses to analyze data, as well as ORDER BY, LIMIT and OFFSET commands. However, since graph databases store data in triples using a simple SUBJECT-PREDICATE-OBJECT data model, SPARQL was designed to query data in this model as a way to better analyze the relationships of data.

Examples of triples include:

  • Franco-IsA-Person
  • Mercedes-IsA-Automobile
  • Franco-Likes-Mercedes

With this simple set of three triples, an analyst could use SPARQL to understand all of the people in a database as well as the context behind the data they produce. Since data in a graph database is stored in a single table of triples rather than multiple tables of data, JOIN commands are not necessary and therefore not part of the SPARQL syntax, nor is most syntax related to fact and dimension tables.

SPARQL use cases

The above specifics allow SPARQL and RDF to be used by data scientists and analysts for a variety of use cases, such as:

Fraud Detection - With SPARQL, an analyst can easily detect relationship patterns such as multiple people sharing the same IP address but reporting to reside in different physical addresses.

Money Laundering - SPARQL is being used to semantically identify and understand the intricate relationships between entities and transactions, including the many individuals and organizations involved with those transactions

Recommendation Engines – SPARQL lets the analyst explore graph relationships between information categories such as customer interests, friends, and purchase history. Then her or she can use SPARQL for product recommendations for a particular customer or customer segment based on which products are purchased by others who follow similar purchase history.

Customer Insight – SPARQL helps you gain new insight into each customer’s likes and dislikes in relation to other customers with similar statistical parameters such as location or demographics.

History of SPARQL

SPARQL and RDF came from an idea first publicized by Tim Berners-Lee, who stressed the need for data that exists on the Web to work better together in government, enterprise, and science. By establishing SPARQL and RDF as a standard protocol, language and data model, the touchstone for data sharing on the worldwide web was created.

This was last updated in May 2019

Continue Reading About SPARQL

SearchNetworking
  • virtual network functions (VNFs)

    Virtual network functions (VNFs) are virtualized tasks formerly carried out by proprietary, dedicated hardware.

  • network functions virtualization (NFV)

    Network functions virtualization (NFV) is a network architecture model designed to virtualize network services that have ...

  • overlay network

    An overlay network is a virtual or logical network that is created on top of an existing physical network.

SearchSecurity
  • X.509 certificate

    An X.509 certificate is a digital certificate that uses the widely accepted international X.509 public key infrastructure (PKI) ...

  • directory traversal

    Directory traversal is a type of HTTP exploit in which a hacker uses the software on a web server to access data in a directory ...

  • malware

    Malware, or malicious software, is any program or file that is intentionally harmful to a computer, network or server.

SearchCIO
  • chief transformation officer (CTO)

    Chief transformation officer is an executive role, often in the C-suite, that focuses on bringing about change as well as growth ...

  • data latency

    Data latency is the time it takes for data packets to be stored or retrieved. In business intelligence (BI), data latency is how ...

  • chief data officer (CDO)

    A chief data officer (CDO) in many organizations is a C-level executive whose position has evolved into a range of strategic data...

SearchHRSoftware
SearchCustomerExperience
  • implementation

    Implementation is the execution or practice of a plan, a method or any design, idea, model, specification, standard or policy for...

  • first call resolution (FCR)

    First call resolution (FCR) is when customer service agents properly address a customer's needs the first time they call.

  • customer intelligence (CI)

    Customer intelligence (CI) is the process of collecting and analyzing detailed customer data from internal and external sources ...

Close