On the approximability of the exemplar adjacency number problem for genomes with gene repetitions

Zhixiang Chen; Bin Fu; Randy Goebel; Guohui Lin; Weitian Tong; Jinhui Xu; Boting Yang; Zhiyu Zhao; Binhai Zhu

doi:10.1016/j.tcs.2014.07.011

On the approximability of the exemplar adjacency number problem for genomes with gene repetitions

Zhixiang Chen, Bin Fu, Randy Goebel, Guohui Lin, Weitian Tong, Jinhui Xu, Boting Yang, Zhiyu Zhao, Binhai Zhu

Research output: Contribution to journal › Article › peer-review

7 Scopus citations

Abstract

In this paper, we apply a measure, exemplar adjacency number, which complements and extends the well-studied breakpoint distance between two permutations, to measure the similarity between two genomes (or in general, between any two sequences drawn from the same alphabet). For two genomes G and H drawn from the same set of n gene families and containing gene repetitions, we consider the corresponding Exemplar Adjacency Number problem (EAN), in which we delete duplicated genes from G and H such that the resultant exemplar genomes (permutations) G and H have the maximum adjacency number. We obtain the following results. First, we prove that the one-sided 2-repetitive EAN problem, i.e., when one of G and H is given exemplar and each gene occurs in the other genome at most twice, can be linearly reduced from the Maximum Independent Set problem. This implies that EAN does not admit any O(n^0.5-ε)-approximation algorithm, for any ε>0, unless P = NP. This hardness result also implies that EAN, parameterized by the optimal solution value, is W[1]-hard. Secondly, we show that the two-sided 2-repetitive EAN problem has an O(n^0.5)-approximation algorithm, which is tight up to a constant factor.

Original language	English (US)
Pages (from-to)	59-65
Number of pages	7
Journal	Theoretical Computer Science
Volume	550
Issue number	C
DOIs	https://doi.org/10.1016/j.tcs.2014.07.011
State	Published - 2014

Keywords

Adjacency
Approximation algorithm
Breakpoint
Genome comparison
NP-hard

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1016/j.tcs.2014.07.011

Cite this

@article{95b07dc21558420d93c34f6f9351619a,

title = "On the approximability of the exemplar adjacency number problem for genomes with gene repetitions",

abstract = "In this paper, we apply a measure, exemplar adjacency number, which complements and extends the well-studied breakpoint distance between two permutations, to measure the similarity between two genomes (or in general, between any two sequences drawn from the same alphabet). For two genomes G and H drawn from the same set of n gene families and containing gene repetitions, we consider the corresponding Exemplar Adjacency Number problem (EAN), in which we delete duplicated genes from G and H such that the resultant exemplar genomes (permutations) G and H have the maximum adjacency number. We obtain the following results. First, we prove that the one-sided 2-repetitive EAN problem, i.e., when one of G and H is given exemplar and each gene occurs in the other genome at most twice, can be linearly reduced from the Maximum Independent Set problem. This implies that EAN does not admit any O(n0.5-ε)-approximation algorithm, for any ε>0, unless P = NP. This hardness result also implies that EAN, parameterized by the optimal solution value, is W[1]-hard. Secondly, we show that the two-sided 2-repetitive EAN problem has an O(n0.5)-approximation algorithm, which is tight up to a constant factor.",

keywords = "Adjacency, Approximation algorithm, Breakpoint, Genome comparison, NP-hard",

author = "Zhixiang Chen and Bin Fu and Randy Goebel and Guohui Lin and Weitian Tong and Jinhui Xu and Boting Yang and Zhiyu Zhao and Binhai Zhu",

note = "Publisher Copyright: {\textcopyright} 2014 Elsevier B.V.",

year = "2014",

doi = "10.1016/j.tcs.2014.07.011",

language = "English (US)",

volume = "550",

pages = "59--65",

journal = "Theoretical Computer Science",

issn = "0304-3975",

publisher = "Elsevier",

number = "C",

}

TY - JOUR

T1 - On the approximability of the exemplar adjacency number problem for genomes with gene repetitions

AU - Chen, Zhixiang

AU - Fu, Bin

AU - Goebel, Randy

AU - Lin, Guohui

AU - Tong, Weitian

AU - Xu, Jinhui

AU - Yang, Boting

AU - Zhao, Zhiyu

AU - Zhu, Binhai

PY - 2014

Y1 - 2014

N2 - In this paper, we apply a measure, exemplar adjacency number, which complements and extends the well-studied breakpoint distance between two permutations, to measure the similarity between two genomes (or in general, between any two sequences drawn from the same alphabet). For two genomes G and H drawn from the same set of n gene families and containing gene repetitions, we consider the corresponding Exemplar Adjacency Number problem (EAN), in which we delete duplicated genes from G and H such that the resultant exemplar genomes (permutations) G and H have the maximum adjacency number. We obtain the following results. First, we prove that the one-sided 2-repetitive EAN problem, i.e., when one of G and H is given exemplar and each gene occurs in the other genome at most twice, can be linearly reduced from the Maximum Independent Set problem. This implies that EAN does not admit any O(n0.5-ε)-approximation algorithm, for any ε>0, unless P = NP. This hardness result also implies that EAN, parameterized by the optimal solution value, is W[1]-hard. Secondly, we show that the two-sided 2-repetitive EAN problem has an O(n0.5)-approximation algorithm, which is tight up to a constant factor.

AB - In this paper, we apply a measure, exemplar adjacency number, which complements and extends the well-studied breakpoint distance between two permutations, to measure the similarity between two genomes (or in general, between any two sequences drawn from the same alphabet). For two genomes G and H drawn from the same set of n gene families and containing gene repetitions, we consider the corresponding Exemplar Adjacency Number problem (EAN), in which we delete duplicated genes from G and H such that the resultant exemplar genomes (permutations) G and H have the maximum adjacency number. We obtain the following results. First, we prove that the one-sided 2-repetitive EAN problem, i.e., when one of G and H is given exemplar and each gene occurs in the other genome at most twice, can be linearly reduced from the Maximum Independent Set problem. This implies that EAN does not admit any O(n0.5-ε)-approximation algorithm, for any ε>0, unless P = NP. This hardness result also implies that EAN, parameterized by the optimal solution value, is W[1]-hard. Secondly, we show that the two-sided 2-repetitive EAN problem has an O(n0.5)-approximation algorithm, which is tight up to a constant factor.

KW - Adjacency

KW - Approximation algorithm

KW - Breakpoint

KW - Genome comparison

KW - NP-hard

UR - http://www.scopus.com/inward/record.url?scp=84926318071&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84926318071&partnerID=8YFLogxK

U2 - 10.1016/j.tcs.2014.07.011

DO - 10.1016/j.tcs.2014.07.011

M3 - Article

AN - SCOPUS:84926318071

SN - 0304-3975

VL - 550

SP - 59

EP - 65

JO - Theoretical Computer Science

JF - Theoretical Computer Science

IS - C

ER -

On the approximability of the exemplar adjacency number problem for genomes with gene repetitions

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this