Neo4j-Apoc
2018-12-24 本文已影响0人
点点渔火
APOC
https://neo4j-contrib.github.io/neo4j-apoc-procedures/#_virtual_nodes_rels
提供的函数 存储过程应有尽有, 也可以自行实现添加
CALL apoc.help("dijkstra")
Apoc配置:
apoc.trigger.enabled=false/true : Enable triggers
apoc.ttl.enabled=false/true: Enable time to live background task
apoc.jdbc.<key>.uri=jdbc-url-with-credentials : 配置数据库连接串
apoc.es.<key>.uri=es-url-with-credentials: ES连接
apoc.mongodb.<key>.uri=mongodb-url-with-credentials: mongodb连接
apoc.couchbase.<key>.uri=couchbase-url-with-credentials: couchbase连接
apoc.jobs.scheduled.num_threads=number-of-threads: APOC调度器的线程池
apoc.jobs.pool.num_threads=number-of-threads: 执行器的线程池
有用的函数方法:
解析域名: WITH 'http://www.example.com/all-the-things' AS url RETURN apoc.data.domain(url) // will return 'www.example.com'
日期函数:
apoc.date.parse('2015/03/25 03-15-59',['s'],['yyyy/MM/dd HH/mm/ss'])
apoc.date.add(12345, 'ms', -365, 'd')
格式转换:
return apoc.number.format(12345.67) as value
return apoc.number.format(12345, '#,##0.00;(#,##0.00)', 'it') as value
数学运算:
RETURN apoc.number.exact.add(stringA,stringB)
RETURN apoc.number.exact.sub(stringA,stringB)
ETURN apoc.number.exact.mul(stringA,stringB,[prec],[roundingModel]
RETURN apoc.number.exact.div(stringA,stringB,[prec],[roundingModel])
RETURN apoc.number.exact.toInteger(string,[prec],[roundingMode])
RETURN apoc.number.exact.toFloat(string,[prec],[roundingMode])
RETURN apoc.number.exact.toExact(number)
比较节点不同:
apoc.diff.nodes([leftNode],[rightNode])
图算法:
路径扩展(选择走哪些边, 哪些节点, 几层, 什么时候结束等等):
CALL apoc.path.expand(startNode <id>|Node, relationshipFilter, labelFilter, minLevel, maxLevel )
MATCH (user:User) WHERE user.id = 460
CALL apoc.path.expandConfig(user,{relationshipFilter:"RATED",minLevel:3,maxLevel:3,bfs:false,uniqueness:"NONE"}) YIELD path
RETURN count(*);
apoc.path.subgraphAll(startNode <id>Node/list, {maxLevel, relationshipFilter, labelFilter, bfs:true, filterStartNode:true, limit:-1}) yield nodes, relationships
MATCH (user:User) WHERE user.id = 460
CALL apoc.path.subgraphNodes(user, {}) YIELD node
RETURN node;
Closeness Centrality:
CALL apoc.algo.closeness(['TYPE'],nodes,'INCOMING') YIELD node, score
Betweenness Centrality:
CALL apoc.algo.betweenness(['TYPE'],nodes,'BOTH') YIELD node, score
pageRank:
CALL apoc.algo.pageRank(nodes) YIELD node, score
// only compute over relationships of types TYPE_1 or TYPE_2
CALL apoc.algo.pageRankWithConfig(nodes,{types:'TYPE_1|TYPE_2'}) YIELD node, score
// peroform 10 page rank iterations, computing only over relationships of type TYPE_1
CALL apoc.algo.pageRankWithConfig(nodes,{iterations:10,types:'TYPE_1'}) YIELD node, score
dijkstra:
apoc.algo.dijkstra(startNode, endNode, 'KNOWS|<WORKS_WITH|IS_MANAGER_OF>', 'distance') YIELD path, weight
apoc.algo.dijkstraWithDefaultWeight(startNode, endNode, 'KNOWS|<WORKS_WITH|IS_MANAGER_OF>', 'distance', 10) YIELD path, weight
example:
MATCH (from:Loc{name:'A'}), (to:Loc{name:'D'})
CALL apoc.algo.dijkstra(from, to, 'ROAD', 'd') yield path as path, weight as weight
RETURN path, weight
community: 标签传播的社区发现算法
apoc.algo.community(times,labels,partitionKey,type,direction,weightKey,batchSize)
example: 遍历25轮,
CALL apoc.algo.community(25,null,'partition','X','OUTGOING','weight',10000)
aStar: A*遍历算法
apoc.algo.aStar(startNode, endNode, 'KNOWS|<WORKS_WITH|IS_MANAGER_OF>', 'distance','lat','lon') YIELD path, weight
cliques: 聚类社区算法:
apoc.algo.cliques(minSize) YIELD clique
apoc.algo.cliquesWithNode(startNode, minSize) YIELD clique
各种距离算法:
apoc.algo.cosineSimilarity([vector1], [vector2]) cosin相似度
apoc.algo.euclideanDistance([vector1], [vector2]) 欧几里得距离
apoc.algo.euclideanSimilarity([vector1], [vector2]) 欧几里得相似度
Virtual Nodes/Rels: 虚拟节点, 关系 类似视图的概念
MATCH (a)-[r]->(b)
WITH head(labels(a)) AS l, head(labels(b)) AS l2, type(r) AS rel_type, count(*) as count
CALL apoc.create.vNode([l],{name:l}) yield node as a
CALL apoc.create.vNode([l2],{name:l2}) yield node as b
CALL apoc.create.vRelationship(a,rel_type,{count:count},b) yield rel
RETURN *;
CALL apoc.create.vPattern({_labels:['Person'],name:'Mary'},'KNOWS',{since:2012},{_labels:['Person'],name:'Michael'})
CALL apoc.create.vPattern({_labels:['Person', 'Woman'],name:'Mary'},'KNOWS',{since:2012},{_labels:['Person', 'Man'],name:'Michael'})
CALL apoc.create.vPatternFull(['British','Person'],{name:'James', age:28},'KNOWS',{since:2009},['Swedish','Person'],{name:'Daniel', age:30})
Cypher Exectuion: 批量执行脚本
CALL apoc.cypher.runFiles([files or urls],{config}) yield row, result
runs each statement in the files, all semicolon separated
CALL apoc.cypher.runFile(file or url,{config}) yield row, result
runs each statement in the file, all semicolon separated - currently no schema operations
Manual Index: 手工索引, 默认不会自动更新
synchronously同步更新 只有创建索引时指定autoUpdate:true, 才会真正生效, 更新图时性能上会有影响
apoc.autoIndex.enabled=true
asynchronously异步更新
apoc.autoIndex.async=true
默认的异步更新参数:50000 operations or 5000 milliseconds 触发
apoc.autoIndex.queue_capacity=100000
apoc.autoIndex.async_rollover_opscount=50000
apoc.autoIndex.async_rollover_millis=5000
apoc.autoIndex.tx_handler_stopwatch=false
添加多个节点属性缩影 配合search用, 重复执行会先删除再创建: apoc.index.addAllNodes('index-name',{label1:['prop1',…],…}, {options})
options的选择(map形式给入):
type: fulltext/exact 全文索引/精确索引
to_lower_case: false/true
analyzer: classname 用哪种classname of lucene analyzer
similarity: classname 相似度计算的方式 classname for lucene similarity
autoUpdate: true/false 是否自动更新
添加一个节点的属性索引(可以不存在这个属性字段) apoc.index.addNode(node,['prop1',…])
对于给定的标签,添加节点索引(索引名称的Label和node可以不一致): apoc.index.addNodeByLabel('Label',node,['prop1',…])
给索引命名 默认是Label作为名称: apoc.index.addNodeByName('name',node,['prop1',…])
apoc.index.addNodeMap(node,{key:value})
apoc.index.addNodeMapByName(index, node,{key:value})
关系索引: apoc.index.addRelationship(rel,['prop1',…])
apoc.index.addRelationshipByName('name',rel,['prop1',…])
apoc.index.addRelationshipMap(rel,{key:value})
apoc.index.addRelationshipMapByName(index, rel,{key:value})
删除节点索引 apoc.index.removeNodeByName('name',node) remove node from an index for the given name
索引模糊查询(计算编辑距离) apoc.index.search('index-name', 'query') YIELD node, weight
apoc.index.nodes('Label','prop:value*') YIELD node, weight
apoc.index.relationships('TYPE','prop:value*') YIELD rel, weight
没理解: apoc.index.between(node1,'TYPE',node2,'prop:value*') YIELD rel, weight
示例:
match (p:Person) call apoc.index.addNode(p,["name","age"]) RETURN count(*); // 129s for 1M People
call apoc.index.nodes('Person','name:name100*') YIELD node, weight return * limit 2
Index Management:
CALL apoc.index.remove('Thing')
展示: CALL apoc.index.list()
CALL apoc.index.forNodes('name',{config}) YIELD type,name,config
CALL apoc.index.forRelationships('name',{config}) YIELD type,name,config
Triggers : 触发器
apoc.trigger.enabled=true
Locking: 锁
call apoc.lock.nodes([nodes])
call apoc.lock.rels([relationships])
call apoc.lock.all([nodes],[relationships])
Meta Graph: 展示节点, 关系标签的概览图
CALL apoc.meta.graphSample()
CALL apoc.meta.graph
有选择的展示一部分结果: CALL apoc.meta.subGraph({labels:[labels],rels:[rel-types],excludes:[label,rel-type,…]})
表格形式展示数据: CALL apoc.meta.data
Map形式展示数据: CALL apoc.meta.schema
快速查看图中各种存在的节点,边: CALL apoc.meta.stats yield labelCount, relTypeCount, propertyKeyCount, nodeCount, relCount, labels, relTypes, stats
Sehema: 查看各种索引, 约束
apoc.schema.assert({indexLabel:[indexKeys],…},{constraintLabel:[constraintKeys],…}, dropExisting : true) yield label, key, unique, action
apoc.schema.nodes() yield name, label, properties, status, type
apoc.schema.relationships() yield name, type, properties, status
apoc.schema.node.indexExists(labelName, properties)
apoc.schema.node.constraintExists(labelName, properties)
apoc.schema.relationship.constraintExists(type, properties)
Streaming Data to Gephi: 导出数据到Gephi
apoc.gephi.add(url-or-key, workspace, data, weightproperty, ['exportproperty'])