Sep 15, 2018 Read test data from Excel sheet in Selenium using POI Avinash Mishra September 15, 2018 Parameterization, Selenium WebDriver No Comments In any test design framework, test data management is the essence. Read test data from Excel sheet in Selenium using POI Avinash Mishra September 15, 2018 Parameterization, Selenium WebDriver No Comments In any test design framework, test data management is the essence. The Apache POI in Selenium is a widely used API for selenium data driven testing. It is a POI library written in Java that gives users an API for manipulating Microsoft documents like.xls and.xlsx. Users can easily create, modify and read/write into excel files. POI stands for “Poor Obfuscation Implementation.”.
Excel files (spreadsheets) are widely used by people all over the world for various tasks related to organization, analysis, and storage of tabular data.
Since excel files are so common, we developers often encounter use-cases when we need to read data from an excel file or generate a report in excel format.
In this article, I’ll show you how to read excel files in Java using a very simple yet powerful open source library called Apache POI.
And in the next article, You’ll learn how to create and write to an excel file using Apache POI.
Let’s get started!
Dependencies
First of all, We need to add the required dependencies for including Apache POI in our project. If you use maven, you need to add the following dependencies to your pom.xml
file -
Poi For Xlsx Read In Selenium For Mac Os
Maven
Gradle
If you use gradle then you can add the following to your build.gradle
file
The first dependency poi
is used to work with the old Microsoft’s binary file format for excel. These file formats have .xls
extension.
The second dependency poi-ooxml
is used to work with the newer XML based file format. These file formats have .xlsx
extension.
Sample Excel file that We’ll read
Following is a sample excel file that we’ll read in our code. It is created using Google Sheets and has .xlsx
extension.
Note that, Although the sample file is of the newer XML based file format (.xlsx
). The code that we’ll write will work with both types of file formats - .xls
and .xlsx
Apache POI terminologies
Apache POI excel library revolves around following four key interfaces -
Workbook: A workbook is the high-level representation of a Spreadsheet.
Sheet: A workbook may contain many sheets. The sample excel file that we looked at in the previous section has two sheets -
Employee
andDepartment
Row: As the name suggests, It represents a row in the spreadsheet.
Cell: A cell represents a column in the spreadsheet.
HSSF and XSSF implementations -
Apache POI library consists of two different implementations for all the above interfaces.
HSSF (Horrible SpreadSheet Format): HSSF implementations of POI’s high-level interfaces like
HSSFWorkbook
,HSSFSheet
,HSSFRow
andHSSFCell
are used to work with excel files of the older binary file format -.xls
XSSF (XML SpreadSheet Format): XSSF implementations are used to work with the newer XML based file format -
.xlsx
.
Program to Read an excel file using Apache POI
The following program shows you how to read an excel file using Apache POI. Since we’re not using any file format specific POI classes, the program will work for both types of file formats - .xls
and .xlsx
.
The program shows three different ways of iterating over sheets, rows, and columns in the excel file -
Note that we’re not even using the concrete classes like HSSFWorkbook
and XSSFWorkbook
to create an instance of the Workbook. We’re creating the workbook using a WorkbookFactory
instead. This makes our program format independent and it works for both types of files - .xls
and .xlsx
.
The program shows three different ways to iterate over sheets, rows, and columns. I prefer the Java 8 forEach loop with a lambda expression. You may use whichever method you like.
Note that, I’ve used a DataFormatter
to format and get each cell’s value as String.
Retrieving Cell values by CellType
Instead of using a DataFormatter
to format and get each cell’s value as String regardless of the Cell type, You may check each cell’s type and then retrieve its value using various type-specific methods like this -
You may now call the above method in the main program to print each cell’s value -
Conclusion
That’s all folks! In this article, You learned how to read excel files in Java using Apache POI library. You can find the entire source code on the github repository.
Also, Don’t forget to check out the next article to learn how to create and write to an excel file using Apache POI
Thank you for reading. Until next time!
Java provides different API to Read and Write a File with help of Selenium. In this Post, we will learn how can we read and write from Excel file by using Java IO package,JXL and Apache POI library.If your are using any Automation Framework you must know About Read and Write fro Excel, We have to use This Concept while working in Data Driven Framework .
Step to Read Data from Excel Using Apache POI jar
To Read or Write From Excel,Apache Community provides a very widely used library POI. This library have features to read and write both XLS and XLSX file format of Excel.HSSF implementation is provided by POI library to read 'XLS' Files and XSSF implementation of POI library to read 'XLSX' File ,Lets go to Deep Die .......
- For user who is using Maven Project can copy Dependency from here
- For user Who is using Simple Java Project Can know How To Download Apache POI Jar and Add To JAVA Build Path
Types of Files in MS-Excel
- .XLS File
- .XLSX File
Types of Classes and InterFace in Apache POI
Apache Interface and Classes |
Read Data from Excel File
- Go To Java Project
- Right Click on Java Project
- Create a New Folder with Name DataFromExcel
- Copy Any Blank Excel File from Your PC and Paste Inside Folder 'DataFromExcel'
- Open Excel File
- Enter The Excel Sheet Name 'LoginDataSheet'
- Enter Any Website Login Credential Whom you want To Read
LoginDataSheet |
SheetName |