187. Repeated DNA Sequences

作者：

在

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: “ACGAATTCCG”. When studying DNA, it is sometimes useful to identify repeated sequences within the DNA.
Write a function to find all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule.
Example:
Input: s = “AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT”
Output: [“AAAAACCCCC”, “CCCCCAAAAA”]
难度：medium
题目：所有的ＤＮＡ是由核甘酸简写字母Ａ，Ｃ，Ｇ和Ｔ组成，例如：ＡＣＧＡＡＴＴＣＣＧ。在研究ＤＮＡ时重复序列有时是非常有用的。写函数找出所有由10个字母组成的且出现过2次及以上的子序列。
思路：hash map 统计子序列。
Runtime: 24 ms, faster than 71.23% of Java online submissions for Repeated DNA Sequences.
class Solution {
public List<String> findRepeatedDnaSequences(String s) {
Map<String, Integer> seqMap = new HashMap<>();
List<String> result = new ArrayList<>();
for (int i = 0; i < s.length() – 9; i++) {
String seq = s.substring(i, i + 10);
Integer count = seqMap.getOrDefault(seq, 0);
if (1 == count.intValue()) {
result.add(seq);
}
seqMap.put(seq, count + 1);
}

return result;
}
}

leetcode 算法

发表回复取消回复

这个站点使用 Akismet 来减少垃圾评论。了解你的评论数据如何被处理。

187. Repeated DNA Sequences

评论

发表回复取消回复

更多文章

苹果iOS打包的ipa应用无法安装？一篇文章带你了解可能的原因及排查方法

图解Golang：从零开始实现简易版过期LRU缓存

深入解析：基于Delta的线性数据结构模型，打造高效富文本编辑器

轻松管理社交媒体：使用Automa插件实现一键拉黑功能

187. Repeated DNA Sequences

评论

发表回复 取消回复

更多文章

苹果iOS打包的ipa应用无法安装？一篇文章带你了解可能的原因及排查方法

图解Golang：从零开始实现简易版过期LRU缓存

深入解析：基于Delta的线性数据结构模型，打造高效富文本编辑器

轻松管理社交媒体：使用Automa插件实现一键拉黑功能

发表回复取消回复