![]() ![]() |
| Guest_ikodan_* |
Dec 21 2012, 08:11 AM
Post
#21
|
|
Guests |
Thank you
|
|
|
|
Dec 22 2012, 11:12 AM
Post
#22
|
|
|
Newbie ![]() Group: Members Posts: 19 Joined: 6-December 12 Member No.: 1,472 |
Thank you web design SEO, for your effort in fixing bugs.
I have succeeded to setup 2 feeds from 7 I have tried. I have few question, just to clear is it scraper limitation or some kind of small bug left. 1. if in RSS I have this form of image tag: QUOTE <media:content url="http://www.site.com/cache/thumbnail/article_large/news/2012/December/image_823553163.jpg"/> how should i setup "image:" parameter on "RSS parser" tab? 2. Can scraper download images from web page, if img tag is in this form: QUOTE <img src="http://www.site.com/thumbnail.php?file=news/2010/september/image_435509317.jpg&size=article_medium" alt="image"> or in this case: QUOTE <img src="resize.aspx?filename=Nikolina/Fotolia_23977439_Subscription_XXL.jpg&width=550" alt=""> 3. Can scraper download img from description field in RSS if img is in format: QUOTE <img src="http://www.site.com/thumbnail.php?file=news/2012/December/bevanda_823553163.jpg&size=summary_large" align="left" alt="Bevanda: BiH je daleko od "Grčkog scenarija"" /> P.s. if you can test, I can send you my XML feeds (urls) by email. |
|
|
|
Dec 24 2012, 07:59 AM
Post
#23
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
sdbit, just test. Here, at bottom in first post is written about warrancy in every website.
-------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Dec 24 2012, 09:28 PM
Post
#24
|
|
|
Newbie ![]() Group: Members Posts: 19 Joined: 6-December 12 Member No.: 1,472 |
OK, I will test myself. I already succeeded to make few feeds import without of problems.
If you can just answer me about my first question (as there is no such case in example): <media:content url="http://www.site.com/cache/thumbnail/article_large/news/2012/December/image_823553163.jpg"/> How to configure "image:" parameter on "RSS parser" tab? and 2. Can scraper download images from web page, if img tag is in this form: <img src="http://www.site.com/thumbnail.php?file=news/2010/september/image_435509317.jpg&size=article_medium" alt="image"> Thanks. To not forget to say to new prospects: THIS IS GREAT PRODUCT. |
|
|
|
Dec 29 2012, 07:53 PM
Post
#25
|
|
|
Newbie ![]() Group: Members Posts: 19 Joined: 6-December 12 Member No.: 1,472 |
OK, I will test myself. I already succeeded to make few feeds import without of problems. If you can just answer me about my first question (as there is no such case in example): <media:content url="http://www.site.com/cache/thumbnail/article_large/news/2012/December/image_823553163.jpg"/> How to configure "image:" parameter on "RSS parser" tab? and 2. Can scraper download images from web page, if img tag is in this form: Код <img src="http://www.site.com/thumbnail.php?file=news/2010/september/image_435509317.jpg&size=article_medium" alt="image"> Thanks. To not forget to say to new prospects: THIS IS GREAT PRODUCT. Hi webdesignseo, Can you pls confirm me that there is a limitation in scraper, that it cant grab image with source like: Код http://www.site.com/thumbnail.php?file=news/2010/september/image_435509317.jpg&size=article_medium Im interested in this as feature request. How much would it cost to make it work with such image src as well? Thanks This post has been edited by Web Design Seo: Dec 30 2012, 12:27 PM |
|
|
|
Mar 15 2013, 02:41 PM
Post
#26
|
|
|
Newbie ![]() Group: Members Posts: 5 Joined: 15-March 13 Member No.: 1,611 |
I'm having a similar problem. I've got K2 > Catch Image set to Yes and no matter if I got Enable image download set to Yes or no, some images are being caught and some - not. I've looked into the imported post text and saw this:
CODE <img width="300px"<hr id="system-readmore" /> src="http://en.piaget.com//pictures/homeKindThumb/1360"><p><strong>...</strong> <a class="rssreadon" rel="nofollow" title="Spirit Awards 2013" href="http://www.piaget.com/events/spirit-awards-2013-ceremony" >Spirit Awards 2013</a></p> Not sure how this could happen. Importing content from this feed: http://en.piaget.com/rss/rss.xml UPDATE: I noticed that this happens when rss item contains only single image and when I have activated - Split introtext: After N chars. Help is highly appreciated! This post has been edited by dave: Mar 15 2013, 06:04 PM |
|
|
|
Mar 16 2013, 06:51 AM
Post
#27
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
Feed and case will be checked in monday
-------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Mar 18 2013, 11:54 AM
Post
#28
|
|
|
Newbie ![]() Group: Members Posts: 5 Joined: 15-March 13 Member No.: 1,611 |
Were you able to look at it? It breaks layout terribly and the whole benefit of scraping in general :|
It keeps dropping these <hr id="system-readmore" /> inside img tags and probably that's what keeps the scraper from recognizing first image in the content in first place. This post has been edited by dave: Mar 18 2013, 12:43 PM |
|
|
|
Mar 18 2013, 02:19 PM
Post
#29
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
This is not bug, component just don't have this function inside. "Split introtext: After N chars" work well only with clear text. I recommend you to use other options in this select.
-------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Mar 18 2013, 02:25 PM
Post
#30
|
|
|
Newbie ![]() Group: Members Posts: 5 Joined: 15-March 13 Member No.: 1,611 |
This is not bug, component just don't have this function inside. "Split introtext: After N chars" work well only with clear text. I recommend you to use other options in this select. But you allow some html tags in there, no? Quite logical to expect the scraper to not insert anything in the middle of another tag. How's this is not a bug? What other option do you recommend? We need to have an excerpt of the full text on listing page and a link to a detailed page with whole text. Do you have any documentation or writeup about all the options? |
|
|
|
Mar 18 2013, 02:35 PM
Post
#31
|
|
|
Newbie ![]() Group: Members Posts: 5 Joined: 15-March 13 Member No.: 1,611 |
Could it at least catch the first image first and only after that drop in that <hr id="system-readmore" /> tag?
|
|
|
|
Mar 19 2013, 09:45 AM
Post
#32
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
Download of images (and import in K2) depends on many things.
Some of these things: - your server load, memory and max execution time. And after import in K2, K2 process pictures to generate thumbnails. - download of images function require really MANY time - 60 or more seconds to finish if are many pictures in feed. - as is described in component function ( https://3dwebdesign.org/forum/joomla-scrape...for-joomla-t698 ) BMP files are NOT supported. - images without extension .png, .jpg, .jpeg and .gif (generated dinamically over php) can not be recognized and imported every time. Are recognized and imported most of times, but not every single case. -------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Mar 19 2013, 10:38 AM
Post
#33
|
|
|
Newbie ![]() Group: Members Posts: 5 Joined: 15-March 13 Member No.: 1,611 |
Download of images (and import in K2) depends on many things. Some of these things: - your server load, memory and max execution time. And after import in K2, K2 process pictures to generate thumbnails. - download of images function require really MANY time - 60 or more seconds to finish if are many pictures in feed. - as is described in component function ( https://3dwebdesign.org/forum/joomla-scrape...for-joomla-t698 ) BMP files are NOT supported. - images without extension .png, .jpg, .jpeg and .gif (generated dinamically over php) can not be recognized and imported every time. Are recognized and imported most of times, but not every single case. The cases you describe are not really relevant to my case. You should simply either check if there's another tag at split point and do not count it. |
|
|
|
Mar 19 2013, 11:23 AM
Post
#34
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
Yes, dave, my previous post is not for you, sorry. Is answer for questions of different user and is just useful to be in this topic.
@Dave, function "Split introtext: After N chars" come from Agggregator Platinum version and is simple function that work well only with clear text. I have already spoken with our programmers previous day, right after your request, but there is no simple way to make it work with html also. Цитат The problem is that img tag can be in other html tag, the other html tag can be inside other tag in html and there is no simple rule. And one more thing: now from all aggregators in the world Joomla Scraper is aggregator with most features, but these features cost server load and can't be used all at once on shared host. Programmers say me that this "potential new feature" will cost more memory and server load and we will go to the critical level. But we don't want this. Цитат Algorithm of scrapper is optimized, lightweight and robust. Can work on every shared host with php 5 and memory over 32 mb. This is the reason to insert inside this component only most useful things, not all things. All others will "convert" component to extension for vps and shared servers only. -------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Mar 29 2013, 10:40 PM
Post
#35
|
|
|
Newbie ![]() Group: Members Posts: 6 Joined: 22-March 13 Member No.: 1,624 |
Web Design Seo, please answer on this question.
QUOTE If you can just answer me about my first question (as there is no such case in example):
<media:content url="http://www.site.com/cache/thumbnail/article_large/news/2012/December/image_823553163.jpg"/> How to configure "image:" parameter on "RSS parser" tab? |
|
|
|
Mar 30 2013, 09:41 AM
Post
#36
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
All examples and config. info are on page of surrent settings in joomla scraper.This is picture:
![]() Edit: There is one more filed (image) in latest version in custom parser. When you configure field picture can be downloaded local and imported in K2 tab image. This function work in Joomla 2.5 and in Joomla 3.0 version.
Reason for edit: picture edited
-------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Apr 1 2013, 06:46 PM
Post
#37
|
|
|
Newbie ![]() Group: Members Posts: 6 Joined: 22-March 13 Member No.: 1,624 |
Please, give us exact answer. What should I put in image field on RSS praser tab if I have this code in desired RSS emission.
CODE <item>
<title>Manchester United target </title> <link>http://www.mirror.co.uk/sport/football/transfer-news/lewandowski-man-united-neymar-chelsea-1795860</link> <description><![CDATA[PLUS: Will Neymar be extending his contract with Santos, Cardiff want ex-England man and so much more...]]></description> <author>Football Spy</author> <pubDate>Mon, 01 Apr 2013 08:48:01 BST</pubDate> <guid>http://www.mirror.co.uk/sport/football/transfer-news/lewandowski-man-united-neymar-chelsea-1795860</guid> <category>Transfer News</category> <media:thumbnail url="http://www.mirror.co.uk/incoming/article806925.ece/ALTERNATES/s98/PaperTalk-806925.jpg" width="96" height="98"/> <media:content url="http://www.mirror.co.uk/incoming/article806925.ece/ALTERNATES/s615/PaperTalk-806925.jpg" type="image/jpeg" width="615" height="410" /> <mir:hascomments>true</mir:hascomments> <mir:commentsID>mirror-1795860</mir:commentsID> </item> |
|
|
|
Apr 2 2013, 07:59 AM
Post
#38
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
You must enter in field "image" this: media:content url.
-------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
Apr 2 2013, 11:15 AM
Post
#39
|
|
|
Newbie ![]() Group: Members Posts: 6 Joined: 22-March 13 Member No.: 1,624 |
It is not working with media:content url. I ve tried media:content.url but it doesnt work too.
|
|
|
|
Apr 2 2013, 02:23 PM
Post
#40
|
|
![]() Web Design Seo ![]() ![]() ![]() ![]() Group: Root Admin Posts: 4,332 Joined: 29-April 09 From: Sofia Member No.: 1 |
Will be tested tomorrow and fixed if we find bug.
-------------------- Правила на форума | Forum Rules | How to receive support. 3D Web Design: Уеб дизайн, Seo оптимизация, Web Site Extensions, Oscommerce Addons, Wordpress plugins and Joomla Extensions. Изработка на уеб сайтове и оптимизация на сайт за търсачки и Seo услуги.
|
|
|
|
![]() ![]() |
Similar Topics
| Topic | Replies | Topic Starter | Views | Last Action | |
|---|---|---|---|---|---|
![]() |
Pinned: Joomla Scraper Can Grab Any Content From Any Website |
86 | Web Design Seo | 590,852 | 8th September 2021 - 07:02 AM Last post by: Web Design Seo |
![]() |
Pinned: Importing Youtube how to import Youtube feeds |
13 | NVC Academy | 74,279 | 22nd July 2020 - 11:28 AM Last post by: Web Design Seo |
![]() |
Pinned: list with new Joomla exploits |
20 | Web Design Seo | 385,815 | 26th September 2018 - 05:07 AM Last post by: Web Design Seo |
![]() |
Pinned: Joomla Pagination Seo Plugin SEO plugin for Joomla Pagination that work in all Joomla |
61 | Web Design Seo | 463,892 | 13th March 2018 - 10:05 AM Last post by: mxcpz |
![]() |
Importing Images That Are In The Rss Feed As A Media File | 1 | Kat | 49,674 | 28th July 2017 - 07:44 PM Last post by: Kat |
![]() |
Pinned: Joomla Scraper Going Open Source No licenses, use scraper on unlimited number of web sites |
0 | Web Design Seo | 352,558 | 8th March 2017 - 07:40 AM Last post by: Web Design Seo |
![]() |
Joomla Ден 2016 Joomla Day 2016 |
1 | Web Design Seo | 310,694 | 31st October 2016 - 10:11 AM Last post by: Web Design Seo |
![]() |
Joomla Post By Email To K2 Extra Fields | 1 | uglykidjoe | 236,947 | 11th February 2016 - 07:45 AM Last post by: Web Design Seo |
![]() |
Pinned: Joomla Scraper Integration With K2 better integration of Joomla Scraper and K2 |
8 | Web Design Seo | 272,126 | 2nd January 2016 - 09:07 AM Last post by: b_goranov |
![]() |
Pinned: Without Support To 4 January 2016 Happy new 2016 Year |
0 | Web Design Seo | 372,667 | 31st December 2015 - 01:01 PM Last post by: Web Design Seo |
|
Lo-Fi Version | Time is now: 1st June 2026 - 09:16 PM |