We would like to welcome Chuanhow Technologies as our newest partner

Chuanhow Technologies is a specialist in cloud computing/big data, software defined networking, network security, data analysis, data loss prevention, application delivery management, business continuity, enterprise software, e-commerce, mobile computing, asset management, social media and an innovation strategy for next generation products. Chuanhow helps vendors in the IT market develop a profitable business, provide a distinctive pathway for development, a new way of thinking to help reseller partners and enterprise customers reduce costs, optimize operations, improve efficiency, and build a profitable portfolio.

We are proud to announce that Everis has become our partner

Everis is a multinational consulting firm providing business and strategy solutions, application development, maintenance, and outsourcing services. Established in 1996, everis has averaged 20% annual growth in revenues and became part of NTT Data in January, 2014.
Being part of the NTT Data group enables everis to offer a wider range of solutions and services through increased capacity as well as technological, geographical, and financial resources

Top-k queries in Cassandra: An embedded mapreduce approach

Stratio has just added top-k queries support to its Lucene based implementation of the Cassandra’s secondary indexes. This implementation was originally designed to allow embedded full-text and multivariable search in Apache Cassandra. The previous release included an ad-hoc mechanism to perform distributed relevance queries based on the Lucene’s scoring algorithm. The current release generalizes this mechanism to allow several types of top-k queries.

Spark-MongoDB library

Once Data Sources API  has been released, we’ve wanted to take advantage of these new features and, for this reason, we have developed a Spark-MongoDB library. With this new connector we help the growing MongoDB community to simplify the interaction with this datasource via Spark.

This library provides the mechanism for accessing MongoDB collections in a structured way from SparkSQL, accesible from Python and Scala API’s. Since MongoDB is an open-source document database leader among NoSQL databases and is highly used in several projects [http://www.mongodb.com/leading-nosql-database] we find this connection with all the operations permitted by SparkSQL not only useful but necessary.