- Clone the repository to your local machine using your preferred Git client or via command line.
- Open the project with Android Studio (version Bumblebee or newer recommended).
- Make sure all dependencies are synced properly with the project by checking the
build.gradle
files. - Connect an Android device or set up an emulator through Android Studio's AVD Manager.
- Build the project by selecting 'Build -> Make Project' in the Android Studio menu.
- Run the application by clicking on the 'Run -> Run 'app'' option and select the connected device or an emulator.
- User has an Android device running on Android-Q or above
- It is assumed that the user's device has a camera that is compatible with the CameraX API.
- The user has basic understanding of how to operate an Android application.
- The device has sufficient storage to save snapshots taken during object detection.
- For generating bounding boxes, a dynamic scale-factor along x and y axes depending on view-size and image-size gave best results. Unfortunately, same failed when combining the canvas with original bitmap for saving into device.
As Jetpack Compose is relatively new, there was limited documentation and support available during development. This was particularly challenging when trying to implement advanced camera functionalities. As a result, the existing articles and resources did not cover the steps to implement the project at all and everything had to be developed from scratch. Despite all the requirements being met, inaccuracy in scaling the coordinates supplied to draw the bounding boxes from the TFLite model on the CameraX Preview screen and while saving the image are present. However, I believe this can be addressed by diving deeper into understanding the relation between device screen size, resolution, output-image size and the camera-frame size.
Addressed By:
- Extensive research and experimentation were conducted to understand the nuances of Jetpack Compose.
- Utilized the latest tutorials, articles, and community forums to find solutions for specific issues related to camera integration with Compose.
The RectF coordinates from the TensorFlow Lite model were not scaled to fit different device resolutions, leading to inaccuracies in drawing bounding boxes around the detected objects.
Addressed By:
- A robust debugging process was established to calculate the size of the camera preview screen, and the size of input and output images.
- Although significant improvements were made, a 100% accuracy in scaling across all devices could not be achieved due to the vast number of screen sizes and resolutions.
The RectF coordinates from the TensorFlow Lite model were not scaled to fit different device resolutions, leading to inaccuracies in drawing bounding boxes around the detected objects. However, the same scaling technique applied to address Challenge-2 did not work because the saved image was in a different resolution depending on the resolution of camera. So, a completely new logic had to be developed.
Addressed By:
- A robust debugging process was established to calculate the size of the camera preview screen, and the size of input and output images.
- Implemented a dynamic scaling factor that adjusts the coordinates based on the device's screen resolution.
- Unfortunately, the boxes still do not surround the detected objects when saved. However, significant improvement was seen after applying a dynamic scaling factor.
InstaLens is an advanced object detection mobile application that leverages the power of TensorFlow Lite to identify objects in real-time through the device's camera. It's built for Android and includes several features and customizations that provide users with a seamless and interactive object detection experience.
InstaLens is equipped with a state-of-the-art object detection model, integrated into an Android project. It utilizes TensorFlow Lite for accurate and efficient detection of objects within the camera's viewfinder.
The application supports both the front and back cameras, giving users the flexibility to switch between different perspectives for object detection. This feature enhances the app's versatility in various scenarios.
A user-friendly UI slider is available to adjust the confidence threshold for object detection. This allows users to filter out detections that fall below a certain confidence level. The threshold level is displayed in a percentage format next to the slider for clear visibility.
A counter is prominently displayed at the top-right corner of the screen, showing the number of objects detected in the current frame. This feature helps users to keep track of the detection activities in real time.
With a simple button tap, users can capture and save a snapshot of the current frame, complete with detection boxes and labels. These images can be stored in the device's gallery or a specific folder designated by the app.
InstaLens adds a layer of customization by categorizing detected objects (such as people, vehicles, animals) and assigning unique colors to each category. This not only makes the detections more visually appealing but also helps in quick identification of object types.
The app is optimized to run smoothly, ensuring that object detection is performed in real time without any significant lag, providing users with a fluid and responsive experience.
Supports both Light and Dark mode.
The app is structured around modern Android development practices, utilizing Kotlin for its development. It features:
- A camera setup that allows for rotation, adapting to the user's choice of the front or back camera.
- An ImageAnalysis use case from the CameraX library that enables processing of the camera feed in real time.
- Integration of a TensorFlow Lite model for object detection.
- A custom
CameraFrameAnalyzer
class that handles frame analysis and object detection logic. - Use of Kotlin coroutines for efficient background processing.
- A
DataStore
preference for storing user settings, including the confidence threshold for detections. - UI components such as
OnBoardingPage
andOnBoardingPageIndicator
for an intuitive user experience, introducing the app's features. - An
ImageScalingUtils
object that ensures detected bounding boxes are scaled appropriately to the size of the display. - Material Design components and theming for a polished UI look and feel.
With these features and its careful implementation, InstaLens stands out as a robust and user-friendly object detection app for Android users.