Wikipedia

Search results

30 June 2013

Big Data Analysis

Big Data Analysis

There's a fantastic series made possible by the University of California, Davis Genome Center, and David Coil on how to access, parse, and analyze big data sequences.

How to display content, and basic grep commands

In summary:

      less fileNamePath
    • used to display file
    • grep item fileNamePath
    • searches and displays item at file
    • grep -c item fileNamePath
    • returns number of occurrences item exists at file

How to parse data with grep

In summary:

      grep item originalFilePath > newFilePath
    • create a new file that contains data with the item parsed
    • grep -v item originalFilePath > newFilePath
    • create a new file containing everything except data with the item parsed

How to analyse data

In summary:

    • find occurrences of item of interest
    • find total number of occurences

Please support this awesome series and subscribe to David's channel on Youtube, where you can also view the series, and more, in it's entirety.

No comments:

Post a Comment