Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • C chipathlon
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 4
    • Issues 4
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • Deployments
    • Deployments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Commits
  • Issue Boards
Collapse sidebar
  • Holland Computing CenterHolland Computing Center
  • chipathlon
  • Wiki
  • meeting notes

meeting notes · Changes

Page history
aknecht2 created page: run times authored Jan 13, 2017 by aknecht2's avatar aknecht2
Hide whitespace changes
Inline Side-by-side
meeting-notes.markdown
View page @ 70349940
2016-02-25
2017-01-13
===========
* Modify ExpressOrtho perl script to confirm to perl style standards.
* Fetch bams or fastqs.
* Specify file accession for control and experiment instead of experiment accession.
* Add IDR for 2 replicates. Wait on response to figure out. (IDR / FDR?)
* MongoDB read only database to clone. MongoDB helper scripts to update database. Need to create own db instance for saving results.
* Don't need to save output files back to the database.
* Generate report log at the end.
* Remove Duplicates yes / no & duplicates.
* Pooled & Pseudo replicates - pooled concatenate, pseudo shuffled (bam -> sam for plaintext). Configurable option.
* Sphinx documentation at the same time.
* Most conditions don't exist, exist for some.
* Collapse Bed & Peak Collections
* Only need one of score or signal_value
* Need to derive read length from downloaded fastq
* Randomly select experiment to control for peak calling, only use possible_controls
* Only need human / mouse
* DFBS keep everything same except one of condition OR cell type. (ignore for now)
* Restrict to database, add genome collection.
2016-03-03
============
* Run on Myc / Max -- wait on correct experiments to use!
* Nothing else!
2016-03-10
==============
* Don't use the assembly from the samples record, use the Grch assembly.
* bowtie2 standard error continues quality measures!
* for now focus on bowtie2
* MXI1
* Add aggregation pipelines to meta to extract relevant transcription factors
2016-03-17
============
* Potentially do comparison paper of no SQL -> sql for bioinformatics. Run similar setup/design/analysis in SQL and see how it compares.
* 3 Papers (Biological paper, Pegasus paper, MongoDB paper)!?
2016-04-05
2016-12-13
============
* Created dummy sample entries to run pipeline faster!
* Look into better downloading tools to increase speed
* Organize files by organism -> cell_type -> tf/hm -> (biorep1, biorep2, idr)
2016-04-12
2016-12-07
=============
* Add gem
* For DFBS: macs2, csaw, jmosaic
* Human: CHD2 for k562 H1-hESC cells
2016-09-22
===============
* macs2 paired end reads need to be run differently check accordingly.
* convert/sort/idr for individual samples.
2016-09-15
=============
......@@ -53,28 +36,47 @@ End Goals:
* Merged replicates in workflow
* Don't need GUI
2016-09-22
===============
* macs2 paired end reads need to be run differently check accordingly.
* convert/sort/idr for individual samples.
2016-12-07
2016-04-12
=============
* Human: CHD2 for k562 H1-hESC cells
* Add gem
* For DFBS: macs2, csaw, jmosaic
2016-12-13
2016-04-05
============
* Organize files by organism -> cell_type -> tf/hm -> (biorep1, biorep2, idr)
* Created dummy sample entries to run pipeline faster!
* Look into better downloading tools to increase speed
2017-01-13
2016-03-17
============
* Potentially do comparison paper of no SQL -> sql for bioinformatics. Run similar setup/design/analysis in SQL and see how it compares.
* 3 Papers (Biological paper, Pegasus paper, MongoDB paper)!?
2016-03-10
==============
* Don't use the assembly from the samples record, use the Grch assembly.
* bowtie2 standard error continues quality measures!
* for now focus on bowtie2
* MXI1
* Add aggregation pipelines to meta to extract relevant transcription factors
2016-03-03
============
* Run on Myc / Max -- wait on correct experiments to use!
* Nothing else!
2016-02-25
===========
* Modify ExpressOrtho perl script to confirm to perl style standards.
* Fetch bams or fastqs.
* Specify file accession for control and experiment instead of experiment accession.
* Add IDR for 2 replicates. Wait on response to figure out. (IDR / FDR?)
* MongoDB read only database to clone. MongoDB helper scripts to update database. Need to create own db instance for saving results.
* Don't need to save output files back to the database.
* Generate report log at the end.
* Remove Duplicates yes / no & duplicates.
* Pooled & Pseudo replicates - pooled concatenate, pseudo shuffled (bam -> sam for plaintext). Configurable option.
* Sphinx documentation at the same time.
* Most conditions don't exist, exist for some.
* Collapse Bed & Peak Collections
* Only need one of score or signal_value
* Need to derive read length from downloaded fastq
* Randomly select experiment to control for peak calling, only use possible_controls
* Only need human / mouse
* DFBS keep everything same except one of condition OR cell type. (ignore for now)
* Restrict to database, add genome collection.
Clone repository
  • H3K36me3_chipseq_experiments
  • Human
    • mouse_genomes_annotations_ENSEMBL_formats
  • TFsHMsMethods
  • TODO
  • experimental design using hg19 and mm9 bam files
  • hg19 and mm9 genome assemblies and annotations
  • Home
  • meeting notes
  • packaging fixes
  • run times