Addressing your goals and needs

Big Data Discovery makes your job faster and easier than traditional analytics tools. This topic discusses your goals and needs in data analysis and shows how Big Data Discovery can address them.

As a data scientist or analyst, you:
  • Solve complex questions using a set of disjointed tools. You start with many dynamic imprecise questions. You often have unpredictable needs for data, visualization, and discovery capabilities. To address them, you rely on open source and custom discovery tools. You use tools in combination with other tools, and often need to reopen the same tools several times.

    In Big Data Discovery, this fragmented workflow between tools is replaced with a single workflow that is natively part of your ecosystem.

  • Need to collaborate. Together with your team, you work with big data from many external and internal sources. Other team members may consume the results of your findings. You also improve and publish insights and prototypes of new offerings.

    In Big Data Discovery, you can create personal projects, or create and share a project with your team.

  • Want to make sense of data. You often need to create and use insights. You do this by collecting, cleaning and analyzing your data.

    Big Data Discovery lets you make sense of data. You use it to collect, join, shape, and analyze your data.

  • Generate ideas and insights. You want to create insights that lead to changes in business, or you want to enhance existing products, services, and operations. You also need to define and create prototypes (or models) for new data-driven products and services.

    In Big Data Discovery, you arrive at insights by using many data visualization techniques. They include charts, statistical plots, maps, pivot tables, summarization bars, tag clouds, timelines, and others. You can save and share results of your discovery using snapshots and bookmarks.

  • Validate, trace back, tweak, and share your hypotheses. You often need to provide new perspectives to address old problems. The path to the solution is usually exploratory and involves:
    • Hypotheses revision. You must explore several hypotheses in parallel. They are often based on many data sets and interim data products.
    • Hypotheses validation. You need to frame and test hypotheses. This requires validation and learning of experimental methods and results done by others.
    • Data recollection and transparency. You would like all stages of the analytical effort to be transparent so that your team can repeat them. You want to be able to recreate analytical workflows created before. You also would like to share your work. This requires linear history of all steps and activities.

    Big Data Discovery helps you by saving your BDD projects, data sets, and transformation scripts. This lets you improve your projects, share them, and apply saved transformation scripts to other data sets.