{"id":23,"date":"2017-01-26T17:34:57","date_gmt":"2017-01-26T17:34:57","guid":{"rendered":"http:\/\/www.juansequeda.com\/blog\/?p=23"},"modified":"2017-01-26T21:11:00","modified_gmt":"2017-01-26T21:11:00","slug":"a-data-weekend-in-austin","status":"publish","type":"post","link":"https:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/","title":{"rendered":"A Data Weekend in Austin"},"content":{"rendered":"<p>On the weekend of January 14-15, I attended <a href=\"http:\/\/datadaytexas.com\/\">Data Day Texas<\/a>, <a href=\"http:\/\/graphday.com\/\">Graph Day Texas<\/a> and <a href=\"http:\/\/datadayhealth.com\/\">Data Day Health<\/a>\u00a0in Austin and gave three talks.<\/p>\n<p><em><strong>Do I need a Graph Database<\/strong><\/em>: This talk came out of a Q\/A during a happy hour after a talk I gave at a meetup in Seattle. We were discussing when to use a Graph Database? What type of graphs should you use: RDF or Property Graph.<\/p>\n<p>http:\/\/www.slideshare.net\/juansequeda\/do-i-need-a-graph-database<\/p>\n<p>&nbsp;<\/p>\n<p><em><strong>Graph Query Languages<\/strong><\/em>: This talk gave an update on the work we have been doing in the Graph Query Language (GQL) task force at the <a href=\"http:\/\/ldbcouncil.org\/\">Linked Data Benchmark Council<\/a> (LDBC). The purpose of the GQL task force is to study query languages specifically for the Property Graph data model because there is a need for a standard syntax and semantics of a query language. One of the main points I was arguing in this talk is the need of a closed language: graphs in, graphs out. One can argue that a reason for\u00a0success of relational databases is because the query language is closed (tables in, tables out). With this principle, queries can be composed (i.e. views!). This\u00a0talk was well received and generated a lot of interesting discussion, specially when <a href=\"https:\/\/www.linkedin.com\/in\/emileifrem\/\">Emil Eifrem<\/a>, Neo Technologies&#8217; CEO is in the room. \u00a0An interesting part of the discussion was if we are too early for standardization. Emil stated that we need standardization now because their clients are asking for it. I stated that graph databases today\u00a0are in the mid 1980&#8217;s of relational databases, so time is about right to start the discussion. <a href=\"https:\/\/www.linkedin.com\/in\/andrewdonoho\/\">Andrew Donoho<\/a>\u00a0said I was\u00a0too optimistic. He thinks we are in the late 70s and we are too early. I will be giving this talk next week at the <a href=\"http:\/\/smartdata2017.dataversity.net\/\">Smart Data<\/a> &#8211; <a href=\"http:\/\/graphorum2017.dataversity.net\/\">Graphorum<\/a>\u00a0conference, with some updated material. Special thanks to Marcelo Arenas, Renzo Angles and specially Hannes Voigt for helping me organize these slides.<\/p>\n<p><em><strong>Semantic Search Applied to Healthcare<\/strong><\/em>: In this talk, I introduced\u00a0how we are identifying patients who are in need of Left Ventricular Assist Devices (LVADs) using Ultrawrap, the semantic data virtualization technology developed at <a href=\"http:\/\/capsenta.com\/\">Capsenta<\/a>. This talk presented a use case with the Ohio State University Wexner Medical Center. Patients are being missed through traditional chart pull methods.\u00a0Our approach has resulted in ~20% increase in detection over previously known population at OSU, which is a mature institution. This talk will also be given at the\u00a0<a href=\"http:\/\/smartdata2017.dataversity.net\/\">Smart Data<\/a>\u00a0conference.<\/p>\n<p>Main highlights of the\u00a0conference:<\/p>\n<ul>\n<li>Emil Eifrem, CEO of Neo Technology gave the keynote. It was nice\u00a0to learn the use cases where Neo4j is being used: Real-time recommendation, Fraud detection, Network and IT operations, Master Data Management, Graph-Based Search and Identity &amp; Access Management.\u00a0It was not clear why were graphs specifically used because these are use cases that have been around for a long time and have been addressed using traditional technologies. Emil ended talking about a &#8220;connected enterprise&#8221;, meaning integrating data across silos using graphs. If you take a look at my <a href=\"http:\/\/www.slideshare.net\/juansequeda\/do-i-need-a-graph-database\">Do I need a graph database talk<\/a>, \u00a0you will see that I argue to use RDF for data integration, not Property Graphs.<\/li>\n<li><a href=\"https:\/\/www.linkedin.com\/in\/garulli\/\">Luca Garulli<\/a>, the founder and CEO of OrientDB gave a talk focusing on the need of a\u00a0multi model database like <a href=\"http:\/\/orientdb.com\/\">OrientDB<\/a>. In his talk, he argued for many features which Neo4J apparently didn&#8217;t support. Not long after, there was a good back-and-forth twitter discussion between Emil and Luca. <a href=\"https:\/\/twitter.com\/pluradj\/status\/820331622159544321\">Emil was correcting Luca<\/a>. Seems like this talk may need to be updated. An interesting take away for me: how do you benchmark a multi model database?<\/li>\n<li>Many talks about &#8220;I&#8217;m in relational, how do I get to property graphs&#8221;. All of them at an introductory level. Given that we have studied very well the problem of relational to RDF, this should be a problem that can be address quickly and efficiently.<\/li>\n<li>Standards was a big topic, one of the reasons my Graph Query Language talk was well received. Neo4j is pushing for <a href=\"http:\/\/www.opencypher.org\/\">OpenCypher<\/a> to become <em>the<\/em> standard, while in fact, one could argue that\u00a0Gremlin is already the defacto standard. Before this weekend, I wasn&#8217;t aware of anybody implementing OpenCypher. Apparently there are now 10 OpenCypher implementation including Bitnine, Oracle and <a href=\"https:\/\/blogs.sap.com\/2016\/12\/01\/graph-processing-with-sap-hana-2\/\">SAP HANA<\/a>.<\/li>\n<li><a href=\"http:\/\/bitnine.net\/\">Bitnine<\/a>: they are implementing a PropertyGraph DB on top of Postgres and using OpenCypher as the query language. They are NOT translating OpenCypher to SQL. Instead, they are doing the translation to relational algebra internally. I enjoyed the brief discussion with Kisung Kim, Bitnine&#8217;s CTO. Apparently they have already benchmarked with LDBC and did very well. Looking forward to seeing public results.\u00a0<a href=\"https:\/\/github.com\/bitnine-oss\/agens-graph\">Bitnine is open source<\/a>.<\/li>\n<li>Take a look at <a href=\"http:\/\/sql2gremlin.com\">sql2gremlin.com<\/a><\/li>\n<li><a href=\"http:\/\/grakn.ai\">grakn.ai<\/a> looks interesting. <a href=\"https:\/\/www.youtube.com\/watch?v=OeFrudRlXAM&amp;t=553s\">Need to take a closer look<\/a>.<\/li>\n<li><a href=\"https:\/\/www.youtube.com\/watch?v=2F6XuNRiwjA\">Cray extended the LUBM benchmark<\/a> and added a social network for the students.<\/li>\n<li>Property Graphs is what comes to mind when people thing about graph databases. However, an interesting observation is that the senior folks\u00a0in the room prefer RDF than Property Graphs. We all agreed that RDF is more mature than Property Graph databases.<\/li>\n<li>&#8220;Those who do not learn history are doomed to repeat it.&#8221; It is crucial to understand what has been done in the past in order to not re-invent the wheel. I feel lucky that early on in grad school, my advisor pushed me to read <em>pre-pdf papers<\/em>. It was great to meet this weekend with folks like <a href=\"https:\/\/www.linkedin.com\/in\/darrellwoelk\/\">Darrel Woelk<\/a> and <a href=\"https:\/\/www.linkedin.com\/in\/mistynodine\/\">Misty Nodine<\/a> who used to be part of <a href=\"https:\/\/en.wikipedia.org\/wiki\/Microelectronics_and_Computer_Technology_Corporation\">MCC<\/a>. A lot of the technologies we are seeing today has roots back to MCC. For example, <a href=\"https:\/\/twitter.com\/juansequeda\/status\/820431143921131520\">we discussed how similar graph databases are to object oriented databases<\/a>. On twitter, Emil seemed to disagree with me. Nevertheless we had an interesting twitter discussion.<\/li>\n<li>Check out <a href=\"http:\/\/janusgraph.org\/\">JanusGraph<\/a>, a graph database, which if I understood correctly, is \u00a0a fork from <a href=\"https:\/\/github.com\/thinkaurelius\/titan\">Titan<\/a>. Titan\u00a0hasn&#8217;t been updated in over a year because the folks behind it are now at <a href=\"http:\/\/www.datastax.com\/products\/datastax-enterprise-graph\">DataStax<\/a>.<\/li>\n<\/ul>\n<p>Thanks to Lynn Bender and co. for organizing such an awesome event! Can&#8217;t wait for it to happen in Austin next year. Recordings of the talks will start to show up on the\u00a0<a href=\"https:\/\/www.youtube.com\/channel\/UCIxhvE4rZl9tMmiusLBOM9Q\">Global Data Geek youtube channel<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>On the weekend of January 14-15, I attended Data Day Texas, Graph Day Texas and Data Day Health\u00a0in Austin and gave three talks. Do I need a Graph Database: This talk came out of a Q\/A during a happy hour after a talk I gave at a meetup in Seattle. We were discussing when to &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;A Data Weekend in Austin&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[2,1],"tags":[],"class_list":["post-23","post","type-post","status-publish","format-standard","hentry","category-conference-report","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v14.8.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>A Data Weekend in Austin - Juan Sequeda&#039;s Blog<\/title>\n<meta name=\"robots\" content=\"index, follow\" \/>\n<meta name=\"googlebot\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta name=\"bingbot\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"A Data Weekend in Austin - Juan Sequeda&#039;s Blog\" \/>\n<meta property=\"og:description\" content=\"On the weekend of January 14-15, I attended Data Day Texas, Graph Day Texas and Data Day Health\u00a0in Austin and gave three talks. Do I need a Graph Database: This talk came out of a Q\/A during a happy hour after a talk I gave at a meetup in Seattle. We were discussing when to &hellip; Continue reading &quot;A Data Weekend in Austin&quot;\" \/>\n<meta property=\"og:url\" content=\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/\" \/>\n<meta property=\"og:site_name\" content=\"Juan Sequeda&#039;s Blog\" \/>\n<meta property=\"article:published_time\" content=\"2017-01-26T17:34:57+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-01-26T21:11:00+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#website\",\"url\":\"https:\/\/www.juansequeda.com\/blog\/\",\"name\":\"Juan Sequeda's Blog\",\"description\":\"Blog about Computer Science, Research, Knowledge Graphs, Semantic Web, Databases, Graphs and Travel!\",\"publisher\":{\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#\/schema\/person\/11d82cc78d011661d7a9ace3be629323\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":\"https:\/\/www.juansequeda.com\/blog\/?s={search_term_string}\",\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/#webpage\",\"url\":\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/\",\"name\":\"A Data Weekend in Austin - Juan Sequeda&#039;s Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#website\"},\"datePublished\":\"2017-01-26T17:34:57+00:00\",\"dateModified\":\"2017-01-26T21:11:00+00:00\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/\"]}]},{\"@type\":\"Article\",\"@id\":\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/#article\",\"isPartOf\":{\"@id\":\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/#webpage\"},\"author\":{\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#\/schema\/person\/11d82cc78d011661d7a9ace3be629323\"},\"headline\":\"A Data Weekend in Austin\",\"datePublished\":\"2017-01-26T17:34:57+00:00\",\"dateModified\":\"2017-01-26T21:11:00+00:00\",\"mainEntityOfPage\":{\"@id\":\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/#webpage\"},\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#\/schema\/person\/11d82cc78d011661d7a9ace3be629323\"},\"articleSection\":\"Conference Report\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"http:\/\/www.juansequeda.com\/blog\/2017\/01\/26\/a-data-weekend-in-austin\/#respond\"]}]},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#\/schema\/person\/11d82cc78d011661d7a9ace3be629323\",\"name\":\"Juan\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/bbfba6f29b3794e885f18fed1999917b8ac661fc907db094811b55ec455897d5?s=96&d=mm&r=g\",\"caption\":\"Juan\"},\"logo\":{\"@id\":\"https:\/\/www.juansequeda.com\/blog\/#personlogo\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","_links":{"self":[{"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/posts\/23","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/comments?post=23"}],"version-history":[{"count":15,"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/posts\/23\/revisions"}],"predecessor-version":[{"id":38,"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/posts\/23\/revisions\/38"}],"wp:attachment":[{"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/media?parent=23"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/categories?post=23"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.juansequeda.com\/blog\/wp-json\/wp\/v2\/tags?post=23"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}