I'm looking to provide some summary stats for a data set and I want to list the different types of edge entities and node(vertex) entities in the graph.
For example:
-> In Twitter Social network graph of users and following relationship (Homogeneous graph), there is only one type of vertex entity (user), but in heterogeneous graphs such as ConceptNet data, it will have multiple values.
-> The edge entities can be computed by just counting the different number of predicates I believe using the query :
SELECT DISTINCT (?p AS ?DistinctEdges) { ?s ?p ?o }
But I am not sure how to do so for vertices. The vertex type can be from a subject or object field of the triple and the object in turn can be either a value(literal) or another resource itself.
Please excuse me if I have gone wrong with the vocabulary anywhere. I have just started working on building a semantic web application.
You can use the
UNION
clause to combine two patterns in conjunction with aFILTER
clause using theIsLiteral()
function to omit literals e.g.The
[]
is an anonymous variable because you don't care about the some of the positions on either side of theUNION
so by giving them an anonymous variable we match any value but don't carry those values out in the query.The
FILTER
clause in the RHS of the union is used to filter out objects which are literals. It is not necessary to have this in the LHS because RDF forbids literal subjects so any?vertex
value from the LHS must be a resource i.e. a URI/blank node