It's our pleasure to highligh the initiative taken by our data team leader Ahmed Mahran to effectively contribute to the Spark Time Series project, created by Sandy Ryza, a senior data scientist at Cloudera, the leading big data solutions provider.

 

Time Series data has gained an increasing attention in the past few years. To quote Sandy Ryza:

 

Time-series analysis is becoming mainstream across multiple data-rich industries. The new Spark-TS library helps analysts and data scientists focus on business questions, not on building their own algorithms.

 

Find the full story here, where he introduces SparkTS, and accredits our contributor.

 

We are, forever, indebted to the open source community, it enabled us to create wonderful feats. It's our deep belief that we should give back to the community in order to guarantee its health and sustainability. We are proud that we effectively contributed to such great project and we are looking forward to more.

BADR, in partnership with QCRI, has developed and published TweetMogaz, a system that allows Arab users to get the maximum information from the Arabic content on Twitter, on the spot.

Basically, TweetMogaz consumes streams of Arabic tweets from Twitter, classifies them into relevant topics, then present them to the users in a much more intelligent way.

By intelligence we mean that TweetMogaz can understand tweets topics’ context, group tweets based on that, and present these groups (topics) to the user for a better user experience.
TweetMogaz also is the only Arabic events detector. It’s constantly searching the Arabic content for hot, trending tweets, gathers tweets that relate and occur in a certain timeframe, then present the user a solid, homogenous story.

To achieve that feat, a thorough research has been done (and is continuously in improvment) to get the best out of the Arabic content on Twitter. The research areas extend to: Information Retrieval, Natural Language Processing, Machine Learning, Distributed Systems and Big Data.

The first publication out of TweetMogaz is a demo paper: TweetMogaz v2: Identifying News Stories in Social Media, by Eslam Elsawy (BADR), Moamen Mokhtar (BADR) and Walid Magdy (QCRI), it's published in CIKM 2014.

Hortonworks
 

2014, Aug 4th — BADR, today announced that it has joined the Hortonworks Systems Integrator Partner program and will be delivering big data solutions powered by Hortonworks Data Platform (HDP). Hortonworks is the leading contributor to and provider of Apache™ Hadoop®. BADR’s customers can now benefit from easier integration with Hadoop to our range of data engineering and visualizations services.
By joining the Hortonworks Systems Integrator Partner program, BADR will strengthen its ability to implement enterprise-level big data solutions including Hortonworks Data Platform, the industry’s only 100-percent open source Hadoop distribution, explicitly architected, built, and tested for enterprise-grade deployments. BADR’s implementation services enable customers to leverage the power of their data and reveal new aspects of hidden information. With the integration of Hadoop, customers can now scale to new extents in volume, variety and velocity.

Hortonworks Data Platform was built by the core architects, builders and operators of Apache Hadoop and includes all of the necessary components to manage a cluster at scale and uncover business insights from existing and new big data sources. With a YARN-based architecture, HDP enables multiple workloads, applications and processing engines across single clusters with optimal efficiency. A reliable, secure and multi-use enterprise data platform, HDP is an important component of the modern data architecture, helping organizations mine, process and analyze large batches of unstructured data sets to make more informed business decisions.

”We welcome BADR to the Hortonworks Systems Integrator Partner Program and look forward to working with them to provide data-driven applications powered by HDP for their customers,” said John Kreisa, vice president of strategic marketing at Hortonworks.
“BADR’s long-standing experience delivering high-value enterprise solutions creates a natural expansion to add Hadoop integration services as companies of all sizes are adopting Hadoop to support their big data projects.”

About BADR

BADR is an established IT company which has now set its sights on changing the world of big data. Established in 2014, this new branch of BADR is dedicated to providing the most effective and innovative big data tools possible to companies large and small in the Middle East. BADR has the knowledge and experience necessary to make a difference in the big data world, and we use this experience to help our customers on every step of the way.
 

PostgreSQL 9 bytea type problems

January 19th 2011, 4:03 amCategory: Rails 5 comments

While supporting a long-term ROR web application, I got into a weird problem while setting up the environment to start up the app on my platform (Ruby 1.8.7, Rails 2.1.1, PostgreSQL 9 and Ubuntu 10.10). Symptoms of the problem show up when trying the following scenario:

In a development environment, start up the server, open a browser and enter the usual ' localhost:3000 ' URL to start the app showing home page; till now, no problems, but, when making any action (clicking a link or even trying to refresh the home page) causes the browser to generate a '500 Internal Server Error'. A problem in the session you might think, so did we.