يعرض 1 - 10 نتائج من 46 نتيجة بحث عن '"Habich A"', وقت الاستعلام: 1.58s تنقيح النتائج
  1. 1

    المصدر: Datenbank-Spektrum. 19:183-197

    الوصف: In-memory column-store database systems are state of the art for the efficient processing of analytical workloads. In these systems, data compression as well as vectorization play an important role. Currently, the vectorized processing is done using regular SIMD (Single Instruction Multiple Data) extensions of modern processors. For example, Intel’s latest SIMD extension supports 512-bit vector registers which allows the parallel processing of 8× 64-bit values. From a database system perspective, this vectorization technique is not only very interesting for compression and decompression to reduce the computational overhead, but also for all database operators like joins, scan, as well as groupings. In contrast to these SIMD extensions, NEC Corporation has recently introduced a novel pure vector engine (supercomputer) as a co-processor called SX-Aurora TSUBASA. This vector engine features a vector length of 16.384 bits with the world’s highest bandwidth of up to 1.2 TB/s, which perfectly fits to data-intensive applications like in-memory database systems. Therefore, we describe the unique architecture and properties of this novel vector engine in this paper. Moreover, we present selected in-memory column-store-specific evaluation results to show the benefits of this vector engine compared to regular SIMD extensions. Finally, we conclude the paper with an outlook on our ongoing research activities in this direction.

  2. 2

    المصدر: GRADES-NDA@SIGMOD

    الوصف: Analyzing connected data in forms of graphs is more relevant than ever. To allow users to write their own custom graph algorithms, graph computation models such as GraphBLAS have been developed. Unfortunately, the popular Java programming language was mostly neglected by existing GraphBLAS implementations so far. To overcome that issue, we present our implementation of essential GraphBLAS concepts in the Java programming language in this paper. For our purpose, we extended the linear algebra library Efficient Java Matrix Library (EJML). To show the benefits of our implementation, we compare us against existing graph algorithm libraries in Java using real world graphs and three graph algorithms.

  3. 3

    المصدر: DaMoN

    الوصف: Data-level parallelism (DLP) is a heavily used hardware-driven parallelization technique to optimize the analytical query processing, especially in in-memory column stores. This kind of parallelism is characterized by executing essentially the same operation on different data elements simultaneously. Besides Single Instruction Multiple Data (SIMD) extensions on common x86-processors, GPUs also provide DLP but with a different execution model called Single Instruction Multiple Threads (SIMT), where multiple scalar threads are executed in a SIMD manner. Unfortunately, a complete GPU-specific implementation of all query operators has to be set up, since the state of the vectorized implementations cannot be ported from x86-processors to GPUs right now. To avoid this implementation effort, we present our vision to virtualize GPUs as virtual vector engines with software-defined SIMD instructions and to specialize hardware-oblivious vectorized operators to GPUs using our Template Vector Library (TVL) in this paper.

  4. 4

    المصدر: ICDE Workshops

    الوصف: NEC Corporation offers a vector engine as a specialized co-processor having two unique features. On the one hand, it operates on vector registers multiple times wider than those of recent mainstream x86-processors. On the other hand, this accelerator provides a memory bandwidth of up to 1.2TB/s for 48GB of main memory. Both features are interesting for analytical query processing: First, vectorization based on the Single Instruction Multiple Data (SIMD) paradigm is a state-of-the-art technique to improve the query performance on x86-processors. Thus, for this accelerator we are able to use the same programming, processing, and optimization concepts as for the host x86-processor. Second, this vector engine is an optimal platform for investigating the efficient vector processing on wide vector registers. To achieve that, we describe an approach to master this co-processor for analytical query processing using a column-store specific abstraction layer for vectorization in this paper. We also detail on selected evaluation results to show the benefits and shortcomings of our approach as well as of the coprocessor compared to x86-processors. We conclude the paper with a discussion on interesting future research activities.

  5. 5

    المصدر: The VLDB Journal. 29:775-795

    الوصف: The long-awaited nonvolatile random-access memory technology NVRAM is finally publicly available on the market and requires significant changes to the architecture of in-memory database systems. Since such hybrid DRAM–NVRAM database systems may be able to keep the primary data solely persistent in the NVRAM, efficient replication mechanisms need to be considered to prevent base data losses and to guarantee high availability in case of various persistent memory failures. In this article, we argue for a software-based replication approach and present compute node-local mechanisms to provide the building blocks—generally available for most platforms—for an efficient NVRAM replication with a low latency and minimal throughput penalty. Within our evaluation, based on both real NVRAM hardware and DRAM-backed emulation, we measured up to 10$$\times $$ less overhead for our optimized replication mechanisms compared to the basic replication mechanism of the Intel persistent memory development kit PMDK. Finally, we present a lightweight switching approach for enabling the adaptive online selection of the best replication mechanism for a given situation.

  6. 6

    المصدر: Journal of Cleaner Production. 205:188-200

    الوصف: The adoption of electric vehicles is key to lowering the consumption of fossil fuels and emission of greenhouse gases. Cross-national surveys studying citizens' purchase intentions regarding electric vehicles (EVs) remain limited, especially when it comes to combining individual micro-level factors and contextual macro-level forces. Based on a cross-national dataset with 2806 respondents from China (n = 1078), Brazil (n = 929), and Russia (n = 799), this study analyzes variations and determinants of purchase intentions for EVs in these three countries. The survey results indicate that purchase intentions for EVs among Chinese citizens is higher than amongst Brazilian and Russian citizens. The purchasing intention of citizens in all three countries is especially high for people who have a wide social network, and if they already know somebody with an EV. Other macro-level factors, including pollution and charging infrastructure, only impact on purchasing intention in Brazil, while government policy initiatives for EVs seem to have limited effects in all three countries. Micro-level factors, such as age and education, do not have any statistically significant effect in Russia and Brazil, and only a weak effect in China. Based on these results, we provide recommendations for business and policy makers who need to anticipate citizens' demand for EVs and design policies suitable to accelerate the adoption of sustainable transport solutions.

  7. 7

    المصدر: DaMoN

    الوصف: Query execution techniques constantly adapt to novel hardware features to achieve high query performance, in particular for analytical queries. In recent years, vectorization based on the Single Instruction Multiple Data (SIMD) parallel paradigm has been established as a state-of-the-art approach to increase single-query performance. However, since concurrent analytical queries are executed independently potentially invoking a set of fully vectorized operators, the same data accesses and computations among different queries may be executed redundantly. Various techniques have already been proposed to avoid such redundancy, ranging from concurrent scans via the construction of materialized views to applying multiple query optimization techniques. Continuing this line of research, we now investigate the opportunity of sharing vector registers for concurrently running queries in analytical scenarios. In particular, our core sharing approach is to process data elements of different queries together within a single vector register. As we are going to show, sharing vector registers to optimize the execution of concurrent queries can be very beneficial in many cases. We therefore demonstrate the feasibility of a new work sharing strategy and thus open up a wide spectrum of future research opportunities.

  8. 8

    الوصف: In this paper, we present MorphStore, an open-source in-memory columnar analytical query engine with a novel holistic compression-enabled processing model. Basically, compression using lightweight integer compression algorithms already plays an important role in existing in-memory column-store database systems, but mainly for base data. In particular, during query processing, these systems only keep the data compressed until an operator cannot process the compressed data directly, whereupon the data is decompressed, but not recompressed. Thus, the full potential of compression during query processing is not exploited. To overcome that, we developed a novel compression-enabled processing model as presented in this paper. As we are going to show, the continuous usage of compression for all base data and all intermediates is very beneficial to reduce the overall memory footprint as well as to improve the query performance.
    Submitted to PVLDB

  9. 9

    المصدر: Communications in Computer and Information Science ISBN: 9783030611323
    SFDI/LSGDA@VLDB

    الوصف: Graph-structured data can be found in nearly every aspect of today’s world which contributes to an increasing importance of this data structure for storing and processing data. From a processing perspective, finding comprehensive patterns in graph-structured data is a processing primitive in a variety of applications, such as fraud detection, biological engineering or social graph analytics. On the hardware side, multiprocessor systems—consisting of multiple processors in a single scale-up server—are the next important wave on top of multi-core systems. In particular, symmetric multiprocessor systems (SMP) are characterized by the fact, that each processor has the same architecture, e.g., every processor is a multi-core and all multiprocessors share a common and huge main memory space. Moreover, large SMPs will feature a non-uniform memory access (NUMA), whose impact on the design of efficient data processing concepts is considerable. In this paper, we give an overview of NeMeSys, our system for scalable near-memory graph pattern matching (GPM) on SMPs. NeMeSys is built on a synthesis of well-known concepts of database systems including a set of graph-tailored and hardware-oriented optimization techniques for scalable GPM on SMPs.

  10. 10

    المصدر: IEEE BigData

    الوصف: Storing and processing data at different locations using a heterogeneous set of formats and data managements systems is state-of-the-art in many organizations. However, data analyses can often provide better insight when data from several sources is integrated into a combined perspective. In this paper we present an overview of our data integration system DataCalc. DataCalc is an extensible integration platform that executes adhoc analytical queries on a set of heterogeneous data processors. Our novel platform uses an expressive function shipping interface that promotes local computation and reduces data movement between processors. In this paper, we provide a discussion of the overall architecture and the main components of DataCalc. Moreover, we discuss the cost of integrating additional processors and evaluate the overall performance of the platform.