Have you ever wondered what it would be like to have your own robotic personal assistant? Say goodbye to mundane, everyday tasks like doing your laundry, or sifting through the pile of email that builds up while you are on vacation. No more dicing onions or organizing all those random files you saved to your desktop. Now you can tell your robot to do it for you so you can spend your time doing what you want to do. Sounds great, right?
But what does this have to do with data analytics, you ask? Well, here at Aunalytics, we have created a “robotic personal assistant” for data scientists−a data science platform called Aunsight.
Why is a special platform necessary? Data scientists want to answer big questions. They want to spend their time finding the relevant indicators and training predictive models to discover interesting insights. However, there are many tasks leading up to these steps−tasks that are complicated, time-consuming, and generally not relished by the vast majority of data scientists.
The obstacles are two-fold. First, working with large sets of data typically requires a vast skill-set. It requires in-depth knowledge of networking, data integration, APIs, and performance computing, to name a few. Secondly, even with this knowledge, there is an added hindrance of overseeing infrastructure, execution, monitoring, and scheduling. That’s a lot of work outside of the fun part−modeling and predicting.
Our team considered using existing analytics platforms that would allow them to automate many of the tasks associated with data science, but discovered that “off-the-shelf” solutions lacked the flexibility needed for work with our diverse range of mid-sized clients.
It would be similar to purchasing a robot that can only do certain pre-programmed tasks. It could put your laundry in the washing machine, but could not fold the clothes. It could delete your emails, but wouldn’t keep those that are actually relevant. It could only slice onions, but not dice them, and it would just plop all those desktop files in the same folder instead of actually organizing them. Suddenly, this robotic personal assistant doesn’t seem all that helpful anymore.
In addition to limited capabilities, the existing platforms were very expensive, which means the cost of using them would be passed along to the clients − a potential roadblock for companies just beginning to explore their data.
The Aunsight Solution
Being the inventive people that they are, our data scientists began developing their own tools to automate processes. These tools were based on open-source assets and allowed our team the flexibility they required to meet the unique needs of our clients. As more and more tools were developed, they realized that had laid the groundwork for an entire data science platform. This data science platform became Aunsight.
Aunsight allows data scientists to write their own code − this flexibility is something that makes Aunsight stand out from other data science platforms. And because it is based on open-source resources, not only is our “robotic personal assistant” programmable; it was created from free parts! This lowers the overall cost to our clients and enables mid-sized companies to afford, explore, and benefit from data analytics.
Aunsight includes many features designed to make data science easier. It was battle-tested and built with the data scientist in mind. Here is a brief overview of how Aunsight is the ultimate data science assistant:
From the beginning of a data analytics project to the end, Aunsight provides tools to make data scientists’ lives easier, every step of the way. Any type of data source can be integrated and aggregated with Aunsight. This flexibility is especially helpful since every company’s data is different. This allows for more data to be aggregated for use in modeling. More data leads to better insights.
Aunsight allows data scientists to create algorithms and build and run workflows. Because flexibility is at its core, it even allows for scripts to be coded in any programming language.
Re-use and schedule
Once workflows are created, they can be re-used, modified, and scheduled. This helps streamline data cleaning and linking as well as the modeling process. Why re-write new code when you can leverage scripts that have been proven to work? The ability to schedule workflows is also a huge asset for data scientists, improving their efficiency and efficacy.
Connecting the data output to visualizations is easy with Aunsight. Data scientists can use the supplied data API to integrate results with a web app in which custom visualizations can be created. Popular visualizations tools like Tableau (a software product) and D3.js (an open-source library) are compatible with Aunsight.
As a data scientist knows, analytics is never truly finished. There will always be new data generated, and the model itself continues to change and improve. Aunsight allows data scientists to easily schedule and monitor maintenance tasks, and is accessible on many types of devices.
Aunsight provides assistance with analytics, every step of the way. Because of this, our data scientists can focus on what they do best − using analytics to discover exciting insights. While it’s not a laundry-folding, email-sorting, onion-dicing, file-organizing robot, Aunsight is the ultimate robotic personal assistant for a data scientist. Data science just got a whole lot easier.