6 Probe ID conversion

Probes are used to measure the expression level of many genes. After getting micro array data, the first important thing before downstream analysis maybe probe id conversion.

Ensembl database regularly maps probes sets against the latest set of transcript models using their own pipeline.

Here, transProbe utilizes Ensembl up-to-date annotation datasets and could join together with other genekitr functions fluently.

Notice: now we only support human data

6.1 Supported platforms

36 probe platforms include Affymetrix, Agilent, Illumina, Codelink and Phalanx.

DT::datatable(hsapiens_probe_platform, options = list(pageLength = 10))

6.2 Basic usage

transProbe has four arguments:

  • id: probe id
  • transTo: the conversion target type. User could select more than one from “symbol”, “entrez”, “ensembl” or “uniprot”.
  • org: human (default)
  • platform: Probe platform. If not given, program will detect automatically.
data(deg, package = "genekitr")
probes <- deg$probe_id

res <- transProbe(probes, transTo = "ensembl", org = "human")
head(probes)
## [1] "8144866" "8066431" "8022674" "7925531" "8081358" "7979931"
head(res)
## # A tibble: 6 × 2
## # Groups:   probe_id [6]
##   probe_id ensembl        
##   <chr>    <chr>          
## 1 8144866  ENSG00000156006
## 2 8066431  ENSG00000196839
## 3 8022674  ENSG00000170558
## 4 7925531  ENSG00000275199
## 5 8081358  ENSG00000256628
## 6 7979931  ENSG00000133997

6.3 Convert to more than one type

User could define transTo as many types:

res2 <- transProbe(probes, transTo = c("ensembl","symbol"), org = "human")
head(res2)
##       probe_id         ensembl     symbol
## 9023   8144866 ENSG00000156006       NAT2
## 15007  8066431 ENSG00000196839        ADA
## 11595  8022674 ENSG00000170558       CDH2
## 17512  7925531 ENSG00000275199       AKT3
## 17126  8081358 ENSG00000256628 ZBTB11-AS1
## 6218   7979931 ENSG00000133997       MED6

Let’s take the first record as example:[ENSG00000228794](https://www.ensembl.org/id/ENSG00000228794) has 38 transcripts.

When we look at its second transcript ENST00000445118.7, probe 7896759 is listed in AFFY HuGene-1_0-st-v1 database.