ArangoDB Blog | Performance (2)

Index types and how indexes are used in ArangoDB: Part I

February 22, 2018

Tags: Performance, geo, General, RocksDB

Estimated reading time: 10 minutes

As in other database systems, indexes can be used in ArangoDB to speed up data retrieval queries, sometimes by many orders of magnitude. Getting the indexes set up the right way is essential for good query performance, so this is an important topic that affects most ArangoDB installations.

(more…)

NoSQL Performance Benchmark 2018 – MongoDB, PostgreSQL, OrientDB, Neo4j and ArangoDB

February 14, 2018

Tags: Performance, General, NoSQL, RocksDB

Estimated reading time: 21 minutes

ArangoDB, as a native multi-model database, competes with many single-model storage technologies. When we started the ArangoDB project, one of the key design goals was and still is to at least be competitive with the leading single-model vendors on their home turf. Only then does a native multi-model database make sense. To prove that we are meeting our goals and are competitive, we run and publish occasionally an update to the benchmark series. This time we included MongoDB, PostgreSQL (tabular & JSONB), OrientDB and Neo4j.In this post we will cover the following topics:

(more…)

Performance Impact of Meltdown and Spectre V1 Patches on ArangoDB

February 13, 2018

Tags: Performance, General, RocksDB, testing

Estimated reading time: 4 minutes

To investigate the impact of the Meltdown and Spectre patches on the performance of ArangoDB, we ran benchmark tests with the two storage engines available in ArangoDB (MMFiles & RocksDB). We used the arangobench benchmark and test tool for these tests.

The tests include 10 different test cases with changing test parameters like concurrency, batch requests and asynchronous execution.

(more…)

An Introduction to Geo Indexes and their performance characteristics: Part II

January 18, 2018

Tags: Performance, geo, General

Estimated reading time: 5 minutes

Geo Index Implementation

This section will cover the MMFiles based geo-index. The algorithm is optimized for in-memory accesses and optimal CPU cache utilization. The main goal for our geo queries is to reject as many distant possible result points as fast as possible.

(more…)

Present and Future of ArangoDB Fulltext Index

January 11, 2018

Tags: Performance, General

Estimated reading time: 2 minutes

The ArangoDB Fulltext index allows you to search for text in arbitrary strings. It is a great way to implement things like autocompletion, product searches or many other use-cases which need some form of fulltext search.The Fulltext Index is suitable for you if your use-case can be broken down to:

Full matches of words
Prefix matches of words
You do not need a “ranking” of the matching documents

(more…)

An Introduction to Geo Indexes and their performance characteristics: Part I

January 04, 2018

Tags: Architecture, Performance, geo, General

Estimated reading time: 8 minutes

Starting with the mass-market availability of smartphones and continuing with IoT devices, self-driving cars ever more data is generated with geo information attached to it. Analyzing this data in real-time requires the use of clever indexing data-structures. Geo data in ArangoDB consists of 2 or more dimensions representing (x, y) coordinates on the earth surface. Searching on a single number is essentially a solved problem, but effectively searching on multi-dimensional data can be more difficult as standard indexing techniques cannot be used.There exist a variety of indexing techniques. In..

(more…)

AWS Neptune: A New Vertex in the Graph World — But Where’s the Edge?

December 06, 2017

Tags: Cluster, Performance, Graphs, General

Estimated reading time: 8 minutes

At AWS Re:Invent just a few days ago, Andy Jassy, the CEO of AWS, unveiled their newest database product offerings: AWS Neptune. It’s a fully managed, graph database which is capable of storing RDF and property graphs. It allows developers access to data via SPARQL or java-based TinkerPop Gremlin. As versatile and as good as this may sound, one has to wonder if another graph database will solve a key problem in modern application development and give Amazon an edge over its competition.

(more…)

Performance analysis with pyArango: Part III Measuring possible capacity with usage Scenarios

October 26, 2017

Tags: Community, Performance, how to, General, monitoring, python, testing, Tutorial

Estimated reading time: 14 minutes

So you measured and tuned your system like described in the Part I and Part II of these blog post series. Now you want to get some figures how many end users your system will be able to serve. Therefore you define “scenarios” which will be typical for what your users do. One such a user scenario could i.e. be:

(more…)

Setting up Datacenter to Datacenter Replication in ArangoDB

October 12, 2017

Tags: Cluster, Enterprise, Architecture, Releases, Replication, Performance, how to, General

Estimated reading time: 7 minutes

Please note that this tutorial is valid for the ArangoDB 3.3 milestone 1 version of DC to DC replication!

Interested in trying out ArangoDB? Fire up your cluster in just a few clicks with ArangoDB ArangoGraph: the Cloud Service for ArangoDB. Start your free 14-day trial here

This milestone release contains data-center to data-center replication as an enterprise feature. This is a preview of the upcoming 3.3 release and is not considered production-ready.

In order to prepare for a major disaster, you can setup a backup data center that will take over operations if the primary data center goes..

(more…)

Performance analysis with pyArango: Part II Inspecting transactions

September 28, 2017

Tags: Community, Performance, General, python, Tutorial

Estimated reading time: 3 minutes

Following the previous blog post on performance analysis with pyArango, where we had a look at graphing using statsd for simple queries, we will now dig deeper into inspecting transactions. At first, we split the initialization code and the test code.

(more…)

Why Graph

Products

Solutions

Developers

Customers

Learn

Company

Pricing

Index types and how indexes are used in ArangoDB: Part I

NoSQL Performance Benchmark 2018 – MongoDB, PostgreSQL, OrientDB, Neo4j and ArangoDB

Performance Impact of Meltdown and Spectre V1 Patches on ArangoDB

An Introduction to Geo Indexes and their performance characteristics: Part II

Geo Index Implementation

Present and Future of ArangoDB Fulltext Index

An Introduction to Geo Indexes and their performance characteristics: Part I

AWS Neptune: A New Vertex in the Graph World — But Where’s the Edge?

Performance analysis with pyArango: Part III Measuring possible capacity with usage Scenarios

Setting up Datacenter to Datacenter Replication in ArangoDB

Performance analysis with pyArango: Part II Inspecting transactions

Quick Links

Info

About Us