Design

Native language Scala now enabled through transformations

2nd August 2016
Anna Flockett
0

Maker of the all-in-one predictive analytics software platform Dataiku Data Science Studio (DSS), have announced the release of Dataiku DSS 3.1. Dataiku have designed it to add additional external integrations, an improved UX interface, five visual machine learning engines, and now have the ability to enable transformations in Apache Spark’s native language, Scala.

The blending of visual code-free and free-form code-based transformations is one of the main strengths of Dataiku DSS for the prototyping and production of data applications. In addition to Python, R, SQL, Hive, Impala, and Pig, Dataiku DSS 3.1 now enables Apache Spark users to write transformations and interactive notebooks in Scala, bringing the power of Spark's native and most performant language to the data teams using Dataiku DSS.

Compared with Python, Scala is considered the ‘engineering language’ for developing data science applications. One of the main advantages of using Scala in an integrated data science production environment is its agility, which allows for easy testing and refactoring of code as a data application is being built. Dataiku DSS users who use Spark now have the ability to write transformations and interactive notebooks in Scala when developing data solutions.

Dataiku DSS 3.1 also introduces new visual machine learning engines that allow users to create incredibly powerful predictive applications within a code-free interface. Users of all skill levels can now leverage HPE Vertica machine learning, H2O Sparkling Water, MLlib, Scikit-Learn, and XGBoost directly from within the visual analysis section of Dataiku DSS 3.1 to apply powerful machine learning algorithms to their data science projects without having to write a single line of code.

“With Dataiku DSS 3.1, we continue to bridge the gap between day to day analytic needs and the latest cutting edge data science technologies,” said Florian Douetteau, CEO and co-founder of Dataiku.

By adding additional machine learning engines and enabling development in Scala, we are bringing even more tools to the table. This allows our users to build the best and most dynamic data science applications quickly, Douetteau added. “All of the new features in this release add to our goal of being a complete, end-to-end platform for the creation, development, and deployment of predictive analytics solutions for any organisation.”

Additional features of DSS 3.1 include:

  • New external databases - Integration with IBM Netezza, Hana, and Big Query.
  • New DSS project home page - Project dependencies are now visible in the user interface. Projects also now have a status (Active, Sandbox, and Archive) and can be tagged and filtered in various ways.
  • Fluid Navigation - A new, fluid way to navigate between items.
  • Better Integration with Tableau - Users can extend Dataiku DSS compatibility by creating custom export formats for datasets, including Tableau .tde files. This allows for better integration with Tableau and other tools.

Dataiku DSS 3.1 enables data teams of all skill levels to develop powerful data analytics solutions using the latest techniques in data science and technologies in Machine Learning.

Featured products

Upcoming Events

View all events
Newsletter
Latest global electronics news
© Copyright 2024 Electronic Specifier