clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters

Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html

Yu, Guangchuang, Li-Gen Wang, Yanyan Han, and Qing-Yu He. clusterProfiler: An R Package for Comparing Biological Themes Among Gene Clusters. OMICS: A Journal of Integrative Biology 16, no. 5 (May 2012): 284–287.

Related Posts

Leave a comment ?

14 Comments.

  1. 您好!我在安装Reactome时显示:
    错误于read.dcf(file.path(pkgname, "DESCRIPTION"), c("Package", "Type")) :
    无法打开链结
    此外: 警告信息:
    1: In download.file(url, destfile, method, mode = "wb", ...) :
    下载的长度70724不等于报告的长度265171851
    2: In download.file(url, destfile, method, mode = "wb", ...) :
    下载的长度121849不等于报告的长度124549
    3: In unzip(zipname, exdir = dest) : 从zip文件中抽取1时出了错
    4: In read.dcf(file.path(pkgname, "DESCRIPTION"), c("Package", "Type")) :
    无法打开压缩文件'reactome.db/DESCRIPTION',可能是因为'No such file or directory'

    请问如何解决?谢谢!

    Reply

    ygc China Unknow Browser Unknow Os Reply:

    这是reactome.db的问题,你是否通过biocLite安装?

    Reply

  2. Hello, I'm trying to use clusterProfiler to analyze a dataset but it is returning an error I don't understand:

    GOtest <- groupGO(gene = test[[1]], organism = "yeast", ont = "CC", level = 2, readable = TRUE)
    Error in .testIfKeysAreOfProposedKeytype(x, keys, keytype) :
    None of the keys entered are valid keys for the keytype specified.

    Any help with how to address this error would be greatly appreciated.

    Reply

    ygc China Unknow Browser Unknow Os Reply:

    yeast was supported only in the current devel version (bioc 2.13) and the gene ID should be ORF ID.

    Reply

  3. Hi, thank you for getting back to me. I am using the current devel version of bioc. Could you be more specific about what ORF ID I should use?

    When I use the entrez gene id I get the same error:
    test12 <- groupGO(gene=test12entrez[[1]], organism = "yeast", ont="CC", level = 2, readable = TRUE)
    Error in .testIfKeysAreOfProposedKeytype(x, keys, keytype) :
    None of the keys entered are valid keys for the keytype specified.

    When I use either the SGD id or the systematic ORF id I get this error:
    Warning message:
    In EXTID2NAME(genes, organism = organism) :
    the input geneID is not entrezgeneID, and cannot be mapped

    Thank you again for your help

    Reply

    ygc China Unknow Browser Unknow Os Reply:

    please give me some of your sample ID.

    Reply

    Rachel United States Unknow Browser Unknow Os Reply:

    ORF ID: "YAL015C" "YAL018C" "YAL037W" "YAL047C" "YAL059W" "YAL067C"

    SGD ID: "S000003725" "S000006064" "S000006435" "S000004592" "S000000067" "S000007362"

    entrez gene ID: "13393612" "854834" "853357" "853479" "1466473" "854944"

    Thank you!!

    Reply

    ygc China Unknow Browser Unknow Os Reply:

    set readable=FALSE, the mapping from yeast ORF to SYMBOL is not supported yet and will be supported soon.

    Reply

  4. 你好,我在运行ck<-compareCluster(geneClusters = clslist,fun='enrichKEGG')时出现错误:“The following `from` values were not present in `x`: .id”,而用cg<-compareCluster(geneClusters = clslist,fun='enrichGO')对GO进行分析是没有问题的,请问是不是因为该gene clusters基因过少?

    Reply

    ygc Hong Kong Mozilla Firefox Ubuntu Linux Reply:

    少到什么程度?看着错误信息,应该是完全没有基因被注释到。kegg的数据是比较老的,因为现在收费了,所以有些基因的注释会缺失。

    Reply

    zkwabm China Google Chrome Windows Reply:

    cluster的基因共有130个,分成了8个子类,这些基因放在一起做KEGG富集分析的时候是没有问题的,共富集了3个pathway

    Reply

  5. 刚才回复的内容丢失了,我再重新写一遍吧。
    这130个基因一起做enrichKEGG分析时,是没有问题的,富集了三个pathway;
    分成子类后,我单独对每个子类进行enrichKEGG分析时结果都为NULL;
    用compareCluster分析enrichKEGG时出现第一次所讲的错误,每个子类的基因个数少则八九个,多则二十多个。

    Reply

    ygc Hong Kong Mozilla Firefox Ubuntu Linux Reply:

    那就真是太少了,而且函数默认只检验基因集大于5的pathway,你才8,9个,null也是正常的。

    你可以设置参数minGSSize = 1, 如果也是null,你再把pvalueCutoff和qvalueCutoff设大一点,起码你可以拿到基因到pathway的注释。

    Reply

    zkwabm China Google Chrome Windows Reply:

    谢谢耐心讲解!

    Reply

Leave a Comment


NOTE - You can use these HTML tags and attributes:
<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>