HodentekHelp: Gemini Ai

Showing posts with label Gemini Ai. Show all posts

Sunday, March 30, 2025

I am a Microsoft 365 family user, how much Microsoft AI can I access?

AI credits is a measure of your AI usage via Copilot in the Edge browser. Every time a question is asked 1 AI Credit is used up.

How many AI credits do you get?

This is best answered by Microsoft as :

Users of the free Designer app (non-subscribers) receive 15 credits per month.

Microsoft 365 Personal and Family subscribers get 60 AI credits per month, which can be used across various applications, including Word, Excel, PowerPoint, Outlook, Microsoft Forms, OneNote, Designer, Notepad, Photos, and Paint.

Copilot Pro subscribers can use AI features in Word, Excel, PowerPoint, Outlook, Microsoft Forms, OneNote, Designer, Notepad, Photos, and Paint without worrying about credit limits.

Note: AI benefits are only available to the subscription owner and cannot be shared with additional Microsoft 365 Family members.

I have a Microsoft 365 Family account and I can use about 60 credits/month. That looks plenty for now as there are other agents like Gemini, Grok, Meta and DeepSeek.

Wednesday, January 29, 2025

Does DeepSeek create the image from text specification?

Conversing with DeepSeek, I was given to understand that it is not supporting creation of images. At present, it is only text.

However, it offered to create an image from text using Python and PIL. As I have been using these libraries, I gave it a try.

Here is the text specification that I used to create an image from Gemini AI [https://hodentekhelp.blogspot.com/2025/01/how-do-you-create-image-using-gemini-ai.html].

"Create an icon with the letter H decorated in yellow, with its insides filled with a web of electronic circuits. The background should be black 23-bit color with 8-bit transparency and the format in png."

Gemini AI created the following image:

Here is the suggestion from DeepSeek:

```python

from PIL import Image, ImageDraw, ImageFont

# Create a blank image with a black background and transparency

width, height = 256, 256 # Icon size

background_color = (0, 0, 0, 0) # Black with full transparency

image = Image.new("RGBA", (width, height), background_color)

# Initialize ImageDraw

draw = ImageDraw.Draw(image)

# Draw the letter "H"

font = ImageFont.truetype("arial.ttf", 150) # Use a system font like Arial

letter_color = (255, 255, 0, 255) # Yellow

draw.text((75, 50), "H", font=font, fill=letter_color)

# Draw a web of electronic circuits inside the "H"

# This is a simplified representation using lines

circuit_color = (0, 255, 0, 255) # Green for circuits

draw.line((90, 100, 90, 200), fill=circuit_color, width=2) # Vertical linej

draw.line((90, 150, 160, 150), fill=circuit_color, width=2) # Horizontal line

draw.line((160, 100, 160, 200), fill=circuit_color, width=2) # Vertical line

draw.line((100, 120, 150, 120), fill=circuit_color, width=2) # Horizontal line

draw.line((100, 180, 150, 180), fill=circuit_color, width=2) # Horizontal line

# Save the image as a PNG file

image.save("icon_H_circuit.png", "PNG")

print("Icon created and saved as 'icon_H_circuit.png'")

```

I ran this code in PyCharm as shown:

from PIL import Image, ImageDraw, ImageFont

# Create a blank image with a black background and transparency
width, height = 256, 256  # Icon size
background_color = (0, 0, 0, 0)  # Black with full transparency
image = Image.new("RGBA", (width, height), background_color)

# Initialize ImageDraw
draw = ImageDraw.Draw(image)

# Draw the letter "H"
font = ImageFont.truetype("arial.ttf", 150)  # Use a system font like Arial
letter_color = (255, 255, 0, 255)  # Yellow
draw.text((75, 50), "H", font=font, fill=letter_color)

# Draw a web of electronic circuits inside the "H"
# This is a simplified representation using lines
circuit_color = (0, 255, 0, 255)  # Green for circuits
draw.line((90, 100, 90, 200), fill=circuit_color, width=2)  # Vertical line
draw.line((90, 150, 160, 150), fill=circuit_color, width=2)  # Horizontal line
draw.line((160, 100, 160, 200), fill=circuit_color, width=2)  # Vertical line
draw.line((100, 120, 150, 120), fill=circuit_color, width=2)  # Horizontal line
draw.line((100, 180, 150, 180), fill=circuit_color, width=2)  # Horizontal line

# Save the image as a PNG file
image.save("icon_H_circuit.png", "PNG")
image.show()

print("Icon created and saved as 'icon_H_circuit.png'")

Here is the image created by DeepSeek:

Well, given the limitations, it did create a H with a minimalist circuit (it decided the color and artwork).

I really think, the high-end chips are needed for creating something more exotic art work or images. This said, what is the utility of imaginary sceneries with a multitude of colors, etc.

The basic thing the AI should deliver is VALUE in the first place and in subsequent places.

Monday, January 27, 2025

DeepSeek rattles the US AI dominance. Can it continue to rattle?

The emergence of DeepSeek AI, a powerful Chinese language model, has sent shockwaves through the US AI industry. Developed with a focus on cost-effectiveness, DeepSeek reportedly functions effectively on lower-end hardware, a stark contrast to US models heavily reliant on high-end chips like Nvidia's. This revelation triggered a significant sell-off in Nvidia stock, highlighting the potential disruption to the current AI landscape.

https://site.financialmodelingprep.com/market-news/nasdaq-futures-plunge-amidst-concerns-over-deepseeks-impact-on-ai-chip-demand

Last night, I downloaded DeepSeek to take a peek and lo and behold, at first sight, looked as good as the Copilot, Gemini AI, and others I have come across.

Well, what does it lack?

However, a notable limitation became apparent: DeepSeek lacks robust image generation capabilities. While it can provide code snippets (like Python with Kivy) to generate images, this approach is less user-friendly and may be hindered by the limitations of lower-end hardware in processing and rendering graphics. In contrast, US models excel in not only creating images but also seamlessly modifying them based on simple text prompts. This is clearly beyond low-end chips.

This development necessitates a renewed focus on innovation and optimization within the US AI sector. US developers must prioritize improving model efficiency and exploring alternative hardware solutions to maintain a competitive edge. While DeepSeek presents a significant challenge, it also serves as a valuable catalyst for further advancements in AI technology.

Does Gemini AI create the image from text prompt?

Creating an image with Gemini AI is straightforward. Simply provide a textual description of the image you need. While Gemini AI can create and display the image, it does not store it. You can download the image to the Downloads folder on a Windows computer or share it using the built-in share icon.

Here is an example of text description used to create an image using Gemini AI on Chrome:

Create an icon with the letter H decorated in yellow, with its insides filled with a web of electronic circuits. The background should be black, 24-bit color with 8-bit transparency, and the format should be PNG.

At first, it did not create a PNG file. Perhaps due to size constraints, it used a JPG format instead. Repeating to ask creation of a PNG file was not successful and most agents have this habit of repeating what they did neglecting to refine them according to the wishes of the user. Sometimes, they get fixated on their result across couple of future sessions.

Other AI agents, such as Meta AI on WhatsApp and Copilot, also have the capability to create images from text. Many other AI agents offer similar functionality.

Here is the image created by Gemini AI.

The image more or less follows the description. Small refinements may sometime lead to totally different image, not an iteration on the previous.

I wanted to check the other parts of the image description. It is possible to check the image file using PIL and Python:

from PIL import Image

def verify_image_properties(image_path):
  """
  Verifies the color and transparency of an image using PIL.

  Args:
    image_path: Path to the image file.

  Returns:
    A tuple containing:
      - True if the dominant color is yellow, False otherwise.
      - True if the image has 24-bit color, False otherwise.
      - True if the image has 8-bit transparency, False otherwise.
  """

  try:
    img = Image.open(path/to/your/image.png)


    # Check color (simplified approximation)
    dominant_color = img.getpixel((img.width // 2, img.height // 2))  # Get center pixel color
    is_yellow = (dominant_color[0] > 200) and (dominant_color[1] > 200) and (dominant_color[2] < 50)

    # Check color depth
    is_24_bit_color = img.mode == 'RGB'

    # Check transparency depth
    has_8_bit_transparency = False
    if img.mode == 'RGBA':
        if img.info.get('dpi') is not None and len(img.info['dpi']) == 2:
            has_8_bit_transparency = True

    return is_yellow, is_24_bit_color, has_8_bit_transparency

  except Exception as e:
    print(f"Error processing image: {e}")
    return False, False, False

# Example usage
image_path = "r'C:\Users\hoden\PycharmProjects\exploreImage\Images\GeminiG.jpg')" # Replace with the actual path

is_yellow, is_24_bit_color, has_8_bit_transparency = verify_image_properties(image_path)

if is_yellow:
  print("Image is predominantly yellow.")
else:
  print("Image is not predominantly yellow.")

if is_24_bit_color:
  print("Image has 24-bit color.")
else:
  print("Image does not have 24-bit color.")

if has_8_bit_transparency:
  print("Image has 8-bit transparency.")
else:
  print("Image does not have 8-bit transparency.")

The code returns the following:

C:\Users\hoden\AppData\Local\Programs\Python\Python312\python.exe C:\Users\hoden\PycharmProjects\exploreImage\Images\VerifyImage.py

Image is not predominantly yellow.

Image has 24-bit color.

Image does not have 8-bit transparency.

Process finished with exit code 0

Did the Gemini AI create an image to fit our description?

The image is not predominantly yellow is true and finding the color in the image center is perhaps the wrong approach.

Other methods may yield better result than the Center Pixel Method in the code for the visible color yellow:

Center Pixel Method: This method checks the color of the pixel at the center of the image. It's simple but may not always represent the overall dominant color.

Most Common Color: This method counts the occurrences of each color and identifies the most frequent one. It can be effective but might not capture the visually dominant color if the image has a lot of background noise

K-Means Clustering: This method groups similar colors together and identifies the most visually impactful color. It's more sophisticated and can provide a better representation of the dominant color but requires more computational resources.