I am supposed to read in a file containing many different email addresses and print them out using an array. The problem is I need to eliminate duplicate emails.
I was able to get my try / catch working and print out the email addresses. However, I am not sure how to go about removing the duplicates. I do not have an understanding of hashcode's or how to use a Set
yet. Any assistance would be appreciated.
Here is what I have so far:
import java.util.Scanner;
import java.io.*;
public class Duplicate {
public static void main(String[] args) {
Scanner keyboard = new Scanner(System.in);
System.out.println("Enter file name: ");
String fileName = keyboard.nextLine();
if (fileName.equals("")) {
System.out.println("Error: User did not specify a file name.");
} else {
Scanner inputStream = null;
try {
inputStream = new Scanner(new File(fileName));
} catch (FileNotFoundException e) {
System.out.println("Error: " + fileName + " does not exist.");
System.exit(0);
}
String[] address = new String[100];
int i = 0;
while (inputStream.hasNextLine()) {
String email = inputStream.nextLine();
// System.out.println(email);
address[i] = email;
System.out.println(address[i]);
i++;
}
}
}
}
You can try going through each element in the array, adding it to another one, checking if the 2nd array contains the next item, if it does skip it. Then just replace the 1st array with the 2nd. (
ArrayList
is better in this case though).so something like this:
you can write a function that run on the array and take one email at a time and when ever it find the same address just set it to null. when you're running on the array to print it, make a condition to print the email only if its not null
Learn
Set
. The time it will take you to learn it is less than the time it will take you to code something that doesn't use it.I'll get you started. Replace this:
String[] address = new String[100];
with this:
Set<String> addresses = new HashSet<String>();
And this:
address[i] = email;
with this:
addresses.add(email);
You don't need the
i
anymore.You're done. If you'd like to print everything out:
That pretty much covers it. Want everything to be automatically sorted? Replace the
HashSet
above withTreeSet
. Now go read this excellent tutorial so that next time, you can get it all done faster and on your own.If you want to remove duplicates you can try something like this:
Read them into a
HashSet
instead. This will handle duplicates for you.Will print
1
.Please use below code for remove duplicates in an integer array.